Film strip interface for interactive content

ABSTRACT

Features are disclosed for dynamically generating a navigation interface for rendering content related to an item. To generate the interface, the system analyzes the content available and organizes the content in an intuitive way that keeps the user oriented as they explore the product. Some aspects also consider the access device rendering the interface to ensure the interface includes content that is appropriately selected and formatted for presentation via the access device.

BACKGROUND

Retailers, merchants, sellers, manufacturers, stylists, designers, andothers may provide or promote user interfaces to enable a user browseitems included in an electronic catalog and/or search an electroniccatalog for items available for purchase or other acquisition. Forexample, the user may browse or scroll through a grid or list of items.Content may be provided for the items to help describe or illustratefeatures of the item. One example of content that may be provided is animage of the item. The image may show the entire item or a portion ofthe item. By including several images of the item different perspectivesof the item may be presented. For example, for a clothing item, it maybe desirable to include a first image showing the one side of the item,a second image showing the reverse side of the item, and a third imageshowing the label for the clothing item. In presenting these differentimages, however, there exists a problem in maintaining context betweenthe views presented in each image and how the views are related.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of various inventive features will now be described withreference to the following drawings. Throughout the drawings, theexamples shown may re-use reference numbers to indicate correspondencebetween referenced elements. The drawings are provided to illustrateexample embodiments described herein and are not intended to limit thescope of the disclosure.

FIG. 1 is a pictorial diagram depicting an illustrative environment inwhich a content enhancement engine is used to augment content describingan item that is accessed from a catalog system using an access device.

FIG. 2 is a block diagram depicting illustrative components of anembodiment of the access device shown in FIG. 1.

FIG. 3 is a block diagram depicting illustrative components of anembodiment of a navigation generator implemented by the contentenhancement engine shown in FIG. 1.

FIG. 4 is a pictorial diagram of an example of content generated by thenavigation engine and displayed by the access device of FIG. 3.

FIG. 5 is a flow diagram depicting an illustrative routine executed bythe navigation generator for generating content for an item.

FIG. 6 is a pictorial diagram of a hierarchy generated by the navigationengine that maps content elements to locations in content such as anoverview image.

FIG. 7 is a block diagram depicting illustrative components of anembodiment of a content hint generator implemented by the contentenhancement engine shown in FIG. 1.

FIG. 8 is a pictorial diagram showing an example of a content hintgenerated by the content hint generator and displayed in conjunctionwith content for an item.

FIG. 9 is a flow diagram depicting an illustrative routine executed bythe content hint generator for generating an enhanced content hint.

FIG. 10 is a pictorial diagram showing how a location for placement ofan enhanced content hint may be detected by the content hint generatorshown in FIG. 7.

FIG. 11 is a pictorial diagram showing how an interaction directionindicated by an enhanced content hint may be detected by the contenthint generator as part of generating an enhanced content hint using anoptical flow of pixels.

FIG. 12 is a block diagram depicting illustrative components of anembodiment of a physics engine implemented by the content enhancementengine shown in FIG. 1.

FIG. 13 is a flow diagram depicting an illustrative routine executed bythe physic engine for generating a physics model for a physics propertyof an item represented in the content.

FIG. 14 is a graph diagram depicting an example waveform of audiocontent related to the item and thresholds for haptic feedback that maybe used by the physics engine to generate the physics model.

FIG. 15 is a flow diagram depicting an illustrative routine executed bythe physics engine for mapping interactions to portions of the content.

FIG. 16 is a schematic diagram of messages that may be sent betweencomponents of the content enhancement engine during an example itembrowsing session.

FIG. 17 is a block diagram depicting an illustrative computing devicethat can implement the content enhancement engine shown in FIG. 1.

DETAILED DESCRIPTION

An item may be described using different types of content such as image,video, audio, or text. Textual content may include descriptions of theitem for display. Video content may include a series of imagesdemonstrating a feature of the item. For example, for a kitchen gadget,it may be desirable to show a video of a person using the gadget. Thevideo may include audio coordinated for playback with the series ofimages. For some items, the audio content may include a recording of anoral description of the item or its features. For some items, the audiocontent may include a recording of a sound the item makes. For example,an alarm clock may have audio content of the alarm sound emitted by thealarm clock.

Content for items is typically shown via a display of a computing devicesuch as a smartphone, tablet computer, laptop, or desktop computer.However, the display and/or computing device may be limited in the waysin which the item can be described. Providing an accurate representationof the item can thus be hindered by the limitations in the displaycapabilities of the device. For example, the contextual relationshipbetween content elements can be lost or obscured. Consider therelationship between an overview image of an item and a detailed imageof the item. For intricate items, such as a rug, it may be difficult tounderstand what part of the rug is shown in the detailed view. Hinting,as described below, may be included to associate the detailed contentwith a location (e.g., region) within the overview image.

Furthermore, in some implementations, content may be received frommultiple sources. For example, users may provide different content foran item. Some content may show the item from the same or similarperspective, while other content may show unique features of the item.In a limited resource environment (e.g., a mobile device, small formfactor device, etc.), it may be desirable to dynamically identify thecontent and layout of content be displayed. This can provide theappropriate content in a contextually appropriate way to a user thataccurately and efficiently describes the item.

Such interfaces may be considered enhanced content because the contentelements are augmented through the dynamic generation of thepresentation of content showing the item. Enhanced content may refer tointeractive content that provide descriptive information about an item.The enhanced content may include an aggregation of content elements thatare coordinated in time and/or space to provide a unified representationof the item. In some instances, the enhanced nature of the content maynot be apparent when accessing the enhanced content via a computingdevice. For example, which locations of the enhanced content that may beinteractive may not be readily apparent. Furthermore, the ways in whichthe enhanced content may be interacted with may not be apparent.Accordingly, it may be desirable to process enhanced content to provideguidance (e.g., hints) as to how to interact with the enhanced content.One enhancement to the content may be dynamically providing an interfacefor navigating the content available for an item. For example, imagesshowing a detailed view of an item may be dynamically selected andmapped to a location or region in an overview image to indicate whichportion of the overview image the detailed view is showing and toindicate in the overview image, which locations have detailed viewsavailable.

Enhanced content may also include enhancing existing content such asvideo or audio content to include interactive descriptive information.For example, video content may be enhanced to respond to userinteractions (e.g., touch, sound, breath, or other input) with thedisplay showing the enhanced content to simulate a physics property ofthe item. For example, if a user is moving their finger on enhancedvideo content of a zipper being zipped to move the zipper head, playbackof audio and haptics (using for example, the actuator or vibration motorin a phone) to correspond of the sound and vibration of the teeth as thezipper is pulled may be controlled. The enhanced content may includegenerating a physics model to correlate the interactions of the user(e.g., finger movements) in time or space with the playback of audio,video, haptics or other responses.

The features described provide a non-limiting advantage of enhancing thepresentation of an item on a display so as to provide a realistic andaccurate representation of the item. Rather than simply presenting oneimage of an item, images, videos, and enhanced content may be presented.For example, a first image showing the item may be presented in a mainviewing portion on a display, and a film strip of thumbnail images maybe rendered near the main viewing area. The thumbnail images mayrepresent previews of other content (e.g., other images, videos, audio,or enhanced content) available for display. The film strip can include acentral frame, and the thumbnail images can be moved relative to thecentral frame in response to an interaction with the film strip. Oneexample of an interaction with the film strip is dragging the film stripback and forth while the central frame remains fixed in order toposition any one of the thumbnail images within the central frame.

In addition to providing previews of content available for display for agiven item, if a thumbnail represents a point of interest within theimage shown in the main viewing portion, an area surrounding the pointof interest can be noted on the image shown in the main viewing portion.When the thumbnail corresponding to one of these points of interest iswithin the frame of the film strip, the corresponding area in the mainimage can be highlighted to indicate which point of interest is infocus. Further, a visual or animated hint can be displayed within thehighlighted area when the corresponding thumbnail is in focus to providean indication of the point of interest shown in the thumbnail and/orthat the type of content associated with the thumbnail is enhancedcontent. This helps orient and relate different content showing the itemrepresented in the film strip to the content shown in the main viewingportion.

If a user selects a thumbnail (e.g., by touch or click), the contentcorresponding to the thumbnail can be presented in the main viewingportion. Depending on a content type (e.g., image content, videocontent, textual content, audio content, enhanced content), presentationin the main viewing portion may include initiating playback of thecontent corresponding to the selected thumbnail (such as by playingvideo or audio content). As the content corresponding to the selectedthumbnail comes into focus in the main viewing portion, the presentationmay include zooming into the point of interest within the main viewingportion. This highlights the region of the content corresponding to theselected thumbnail relative to the content previously shown in the mainviewing portion and helps maintain context for overall item review.

The selected content may show a detailed portion of the item. Afterviewing content showing a detailed portion of the item, a user may wishto return to content showing an overview of the item. This may beaccomplished by detecting a selection of a thumbnail corresponding tothe content showing the overview of the item from the film strip.Alternatively, a control element may be provided such as a back-arrow.If the control element is activated, the content showing the overview ofthe item may be presented in the main viewing portion, replacing thecontent showing a detailed portion of the item. The presentation mayinclude zooming out from the point of interest shown in the contentshowing a detailed portion of the item to the content showing theoverview of the item. This again maintains a navigational context forthe interface and orients the user to the features of the item asillustrated in the content showing the detailed portion of the item andthe content showing the overview of the item in a more realistic andintuitive way.

Content showing the item may be enhanced content. Content identified asenhanced content may be configured to respond to a detected interaction.For example, the functioning of a zipper on a garment may bedemonstrated using interactive content. In such an interactive content,a series of images (e.g., video) of the zipping action may be captured.The frames of the image may be mapped to an interaction with the zippersuch that if a user selects the zipper tab and then drags along thezipper, the images shown correlate to the unzipped state of the garment.

For a given item, it may not be readily apparent which content isenhanced content or in what ways a user can interact with enhancedcontent. The enhanced content may be dynamically augmented with hintsfor presentation to a user. The hints may identify the content asenhanced content and suggest how the enhanced content may be activatedand/or interacted with. The introduction of hints into an initialpreview of content may suggest the presence of enhanced content uponuser interaction. The hints may be presented to show a user what actionthey can take to further explore the content, and/or a preview of whatwill happen if they do.

Hints can be auditory or visual “veils.” With a veil, a shadow may bedisplayed that passes across the image to hint at an action that can bemade by user interaction. With a “ghost,” a ghost image/video may bepresented to preview what will happen when you interact with the image.In some implementations, a combination of each of these hints may beincluded based on the desired description of the item needed, the accessdevice capabilities for displaying the hints, and/or the user of theaccess device's ability to perceive the hints (e.g., due to visualimpairments, color vision impairment, hearing impairment).

Enhanced content can respond to the user's touch, sound, breath, orother input while the digital representation of the item is beinghandled, or after the user releases the interaction. For example, if auser is moving their finger on a video of a zipper being zipped to movethe zipper head, playback of audio and haptics (using for example, theactuator or vibration motor in a phone) to correspond of the sound andvibration of the teeth as the zipper is pulled may be controlled. Sound,haptics, or visual hints can be included in a physics model of the itemto mimic the feel of the zipper, and if it catches on the fabric, or isdifficult to pull. Physics can also be mimicked when the user liftstheir finger/mouse from one of interactions to increase realism. Withthe zipper example, if the user lifts their finger after sliding itpartially across the span of movement, the video is paused at thecorresponding frame, displaying the zipper half-open. This simulates thephysical property of isotonicity for the item.

In different situations, e.g., when a user uses fingers to manipulateenhanced video content demonstrating the elasticity of a fabric, whenthe user releases their engagement with the enhanced video content, thevideo plays back to the original frame, providing a simulation of thefabric's shape returning to the unstretched state. This simulates thephysical property of elasticity for the item.

These simulated isotonic or elastic reactions can be mapped closely tothe physical properties of the material or situation through generativeprocesses or through capture methods. For example, in a video where ahand moves across the bristles of a brush, the audio waveform can beused as a proxy for a haptic response corresponding to the ‘feel’ of thebristles. As each bristle is engaged, the waveform of the audio istransformed to a corresponding pulse to the vibration motor of thephone, which makes it “feel” as if the bristles are being stroked.

The physics modeling feature provide a mapping of the real-worldinteraction response to a digital proxy, which a user's brain “fills in”via mirror neurons or vicarious observation to provide a more immersiveexperience of the item.

FIG. 1 is a pictorial diagram depicting an illustrative environment 100in which a content enhancement engine 1700 is used to augment contentdescribing an item that is accessed from a catalog system 110 using anaccess device 200. The environment 100 shown in FIG. 1 includes anaccess device 200. The access device 200 may be configured to exchangemessages with the catalog system 110. The messages may include queriesfor items or requests for content for an item. The access device 200 maybe implemented as an appliance, a video conferencing system, personalcomputing device, laptop computing device, hand held computing device,terminal computing device, mobile device (e.g., mobile phones or tabletcomputing devices), wearable device configured with network access andprogram execution capabilities (e.g., “smart eyewear” or “smartwatches”), wireless device, electronic reader, media player, homeentertainment system, gaming console, set-top box, television configuredwith network access and program execution capabilities (e.g., “smartTVs”), a combination of one or more of the foregoing, or some otherelectronic device or appliance. The access device 200 may transmitmessages to and receive messages from the catalog system 110 and/or thecontent enhancement engine 1700. The messages may be formatted accordingto a standardized protocol such as TCP/IP, HTTP, FTP, or the like. Thetransmission may be sent wired, wireless, or hybrid wired-wirelessnetworks.

The access device 200 may be configured to provide a display 400 ofcontent received from the catalog system 110 or from the contentenhancement engine 1700. In some implementations, the display 400 may bean interactive display such as a touchscreen or a display coupled withan input device such as a mouse, keyboard, stylus, or the like. In suchimplementations, the display 400 may also detect interactions at one ormore points on the display 400. An interaction may include locationinformation indicating where the interaction was detected. Aninteraction may include an interaction type such as a click,double-click, identification of which button was clicked, a tap, agesture, etc. An interaction may also include a direction. For example,a user may click and drag from left to right on the display 400. In suchimplementations, the interaction may include information indicating leftto right movement for the interaction. An interaction may also include amagnitude indicating an amount of interaction. For example, when theinteraction is a swipe, the magnitude as indicated by the length of theswipe may be measured and provided as part of the interaction detection.As another example, the magnitude may be measured based on how quicklythe swipe is made or how much pressure was applied during the swipe. Itwill be appreciated that the magnitude may be detected using a compositeof one or more of direction, speed, pressure, and duration ofinteraction.

The access device 200 may be configured to provide other types of outputthan visual output. As shown in FIG. 1, the access device 200 may becoupled with an audio output device 102. The audio output device 102 maybe implemented as a speaker. The audio output device 102 may beconfigured to play sounds included in the item content. For example, ifthe item is an alarm clock, it may be desirable to provide users accessto an audio content representing the sound of the alarm. As anotherexample, if the content is video content, there may be both an audio andvisual content. The audio output device 102 may be used to present theaudio portion of the video content.

Another type of output the access device 200 may provide is hapticfeedback. As shown in FIG. 1, the access device 200 may be coupled witha haptic output device 104. The haptic output device 104 may providetactile feedback such as vibrations to a user of the access device 200.One example of a haptic output device 104 is a rotary vibrator. Thehaptic output device 104 may be configured to provide varying degrees ofvibrations (e.g., over a range of frequencies and/or intensities) basedon information provided by the access device 200. Other output devicesthat may be coupled or integrated with the access device 200 include anaugmented or virtual reality device (e.g., goggles), a tele-immersiondevice, or smell generator. In some implementations, the output devicesmay be included within a physical item such as a chair, dashboard, desk,bed, or pillow.

The access device 200 may also communicate via the network 108 with acatalog system 110. The catalog system 110 may be included to provide acentral repository of items that can be accessed by a user. The catalogsystem 110 may provide functionality for users to, e.g., interactivelybrowse, search for, review, rate, tag, purchase, and sell catalog itemssuch as appliances, electronics, books, clothing, jewelry, and the like.Examples of such systems are described in greater detail, e.g., in U.S.Pat. Nos. 7,685,074; 7,472,077; and 8,122,020, each of which isincorporated herein by reference in its entirety.

The access device 200 may be configured to request information from thecatalog system 110. For example, the access device 200 may submit aquery to the catalog system 110. A search engine 122 may process thequery to identify items and/or content describing an item from an itemdata store 126. The identification may be a literal identification(e.g., exact match between item data and the item search request). Theidentification may be a partial match (e.g., words with similar stems orsynonyms matching). In some implementations, the search may be executedagainst an index of the item data store 130.

The catalog system 110 may also include a presentation engine 124. Thepresentation engine 124 may be configured to present items included inthe item data storage 126 based on information included in the itemrequest. In some implementations, the catalog system 110 and the accessdevice 200 may establish a session. A session generally refers to aseries of queries and responses between the catalog system 110 and theaccess device 200. The session can provide implicit information for anitem request. For example, if two item queries for strategy board gamesare received by the catalog system 110 during a session, it can beinferred that the category of items the user is looking for is games.This information can be used to augment queries at the search engine 122such as by adding the implicitly identified criteria to searches. Thisinformation can be used to filter or organize items for presentation bythe presentation engine 124. For example, if items in the category ofboard games were returned for a first search, a second search for“hammer of the scots” would rank board game related items more highlythan books or movies with the same or similar title. The augmentationmay also exclude items for a search from unrelated categories. Using the“hammer of the scots” example, items in the category of hardware may beexcluded from the search. The relationship between categories may beidentified using natural language models, dictionaries, or somecombination thereof.

To improve the interface and provide a more realistic and accuratepresentation of the item, the content enhancement engine 1700 may beincluded. As shown in FIG. 1, the content enhancement engine 1700 is adevice in communication with the catalog system 110 and the accessdevice 200 via the network 108. In some implementations, all or portionsof the content enhancement engine 1700 may be implemented within thecatalog system 110 or the access device 200. The content enhancementengine 1700 may receive content for an item to be enhanced. Oneenhancement may be to provide a navigation interface to present thecontent for an item in a contextually accurate way.

A navigation generator 300 may be included to analyze the content for anitem and generate enhanced content including, for example, thenavigation interface. Another enhancement may be to provide hints aboutcontent. A content hint generator 600 may be included to the type ofcontent received from the catalog system 110 and identify relationshipsbetween the content. For example, a first content element may be animage showing a wristwatch and a second content element may be a videoshowing a close up of the watch face. The second content element isrelated to a portion of the wristwatch (e.g., the watch face) shown inthe first content element. The hints can indicate interactive featuresfor enhanced content or that additional content is available for aspecific location of displayed content.

Another enhancement may be generating a physics model for content. Thephysics model may be generated to provide a representation of a physicalcharacteristic of the item using image, sound, haptics, and/or otherperceivable feedback presented via the access device 200. A physicsengine 1100 may be included to generate one or more physics models forcontent. In some implementations, the physics engine 1100 may alsotailor the model to the specific feedback elements (e.g., display,audio, haptic, etc.) for the access device that will render the enhancedcontent.

The system 100 may include a content enhancement workstation 170. Thecontent enhancement workstation 170 may be configured to receive itemcontent and acquire enhancements for the item content. For example, somefeatures may not be detectable by the content enhancement engine 1700.Such features may include a physics property of an item, identifying theitem represented by the item content, or interactive features ofenhanced content. The content enhancement workstation may present aninterface to receive annotation information for the item content such asareas of interest, features, or the like. The annotation information maybe used by the content enhancement engine 1700 to process the itemcontent (e.g., generate enhanced content).

In some implementations, the content enhancement workstation 170 may becoupled with an item physics sensor 175. As shown in FIG. 1, the contentenhancement workstation 170 is coupled with one item physics sensor. Insome implementations, it may be desirable to include more than one itemphysics sensor to collect different physics properties. The contentenhancement workstation 170 may be configured to transmit controlmessages to the item physics sensor 175 to detect a physics property ofan item. For example, the item physics sensor 175 may be implemented asa light detector. In such implementations, the light detector may beused to collect optical properties for an item such as absorbance,photosensitivity, reflectivity, refraction/refractive index, scattering,or transmittance. The sensing by the light detector may be controlledvia a message transmitted from the content enhancement workstation 170to the light detector. The message may include which optical propertiesto collect, a quality of light to detect (e.g., color, hue, brightness,etc.), or the format of the data collection (e.g., resolution for thedetection, number of bits to use to represent the detected values,etc.). In some implementations, the item physics sensor 175 may beimplemented as a magnetometer configured to detect magnetic propertiesof an item such as diamagnetism or magnetism. In some implementations,the item physics sensor 175 may be implemented as an acoustics sensor(e.g., microphone and audio analysis components) configured to detectacoustic properties of an item such as absorption, reflection, andamplification. In some implementations, the item physics sensor 175 maybe implemented as a device or system configured to detect a physicalproperty of the item such as elasticity, malleability (compression),ductility (stretching), strength (the opposite of deformability),stiffness/flexibility (bendability), viscosity (liquid flow),viscoelasticity (rubber bands/silicone products), or toughness(resistance to fracture). The item physics sensor 175 may be implementedas a device or system configured to sense thermal properties of an itemsuch as boiling point, melting point, thermal expansion, emissivity,flammability, flash point, auto-ignition temperature, fire point,glass-transition temperature, heat capacity, or thermal conductivity.

While the access device 200, the content enhancement engine 1700, andthe catalog system 110 are shown in FIG. 1 as separate entitiesconfigured to communicate via the network 108, in some implementations,it may be desirable to integrate two or more of the systems into asingle system. For example, it may be desirable to include the contentenhancement engine 1700 in the access device 200 or the catalog system110.

FIG. 2 is a block diagram depicting illustrative components of anembodiment of the access device shown in FIG. 1. The access device 200shown in FIG. 2 provides a detailed view of one implementation of thestructures which may be included in an access device to provide at leastsome of the described features. The access device 200 shown in FIG. 2 isa simplified example in which extraneous elements have been omitted tofocus the reader on the specific features discussed. Additional elementsmay be added without departing from the intended scope of whatconstitutes an access device.

A processor unit 240 may be included to coordinate the functions of theaccess device 200. For example, the processor unit 240 may be configuredto coordinate the process of receiving requests for content (e.g., queryparameters such as keywords) and presenting one or more interfacesincluding the requested content.

The access device 200 shown in FIG. 2 includes a network input/outputinterface (“network I/O”) 235. The network I/O 235 is configured totransmit and receive communications via a network such as the Internet,a local area network, a wide area network, a peer-to-peer network, acellular network, a satellite network, a public switched telephonenetwork, any combination thereof, or other communication system forexchanging data between electronic devices. In one implementation, thenetwork I/O 235 is configured to transmit request messages for item dataand receive content (including enhanced content) in response, as will bedescribed in further detail below.

As content for an item is received by the access device 200, an itemasset manager 220 may be configured to organize the received content.This can include directing the storage of the received content in thememory 250. In the implementation shown in FIG. 2, the item assetmanager 220 is implemented as instructions residing in a memory 250that, when executed, cause the access device 200 to perform thefunctions described. In some implementations, the item asset manager 220may be implemented as a hardware device (e.g., circuit) configured toperform the functions described.

The item asset manager 220 may manage the removal of content from thememory 250. For example, if a session is terminated, the item assetmanager 220 may remove content obtained during the session or anotherprevious session. For example, a threshold number of sessions' data topreserve may be defined, such as via an item processing configuration.Once the number is exceeded, the content from the oldest session isremoved. In some implementations, the threshold may be specified as aquantity of memory. When the quantity of memory used by content exceedsthe predetermined threshold, the item asset manager 220 may removecontent, starting with the oldest content as measured between a time thecontent was received and the current time.

The access device 200 may include an interface generator 210 configuredto cause display of a graphical interface showing at least a portion ofthe received content. In the implementation shown in FIG. 2, theinterface generator 210 is implemented as instructions residing in thememory 250 that, when executed, cause the access device 200 to performthe functions described. In some implementations, the interfacegenerator 210 may be implemented as a hardware device (e.g., circuit)configured to perform the functions described.

The interface generator 210 may provide display instructions that can beused by the display 400 for presenting the content. In someimplementations, the interface generated by the interface generator 210may include one or more control elements configured to detectinteractions by the user. As the interface detects the interactions,interaction information may be received by the access device 200. Theinteraction information may be used by the interface generator 210 toadjust the interface. For example, if a user selects a thumbnail imageassociated with content showing one view of the item, the selection maycause the interface generator 210 to present an animated transition fromcontent currently shown in a main viewing area to content associatedwith the selected thumbnail image.

An audio output 285 may be included in the access device 200 to causepresentation of audio content. For example, where the item is a teakettle, it may be desirable to provide an audio file including the soundthe tea kettle makes when water is boiling. In some implementations, theaudio output 285 may be provided in conjunction with visual output. Forexample, a video may include a corresponding audio track forsynchronized playback.

Enhanced content, such as content including a physics model, may bereceived for an item at the access device 200. An item physics engine230 may be provided to coordinate the presentation of the physicalproperties identified by the physics model through the devices availableto the access device 200 such as a display, a haptic output device 287,or the audio output 285. In the implementation shown in FIG. 2, the itemphysics engine 230 is implemented as instructions residing in the memory250 that, when executed, cause the access device 200 to perform thefunctions described. In some implementations, the item physics engine230 may be implemented as a hardware device (e.g., circuit) configuredto perform the functions described.

The access device 200 may further include a power source 280. The powersource 280 may include a wired or wireless source of power sufficient tosupply operational power to the elements described. In someimplementations, the power source 280 may include a battery or otherrechargeable source. In some implementations, the power source 280 mayinclude a solar panel array.

The elements of the access device 200 are coupled by a bus 290. The bus290 is configured to allow the elements to exchange data (such as themessages described herein) and/or power. In some implementations,parallel busses may be included, one for data and one for power.

In some implementations, the access device 200 may include anon-transitory computer-readable memory configured to store executableinstructions. The memory 250 may be configured for this purpose. In sucha configuration, the processor unit 240 may be in data communicationwith the computer-readable memory storing the executable instructions.The processor unit 240 may be programmed by the executable instructionsto implement the features described.

As noted, the access device 200 shown in FIG. 2 is a simplified accessdevice and other implementations may include additional features. One ormore microphones may be included to capture audio from the user (e.g.,utterances or other sounds). A speech processing system may be included.The speech processing system may implement various modules andcomponents for receiving spoken input from a user and determining whatthe user meant. In some implementations, a speech processing system caninclude an automatic speech recognition (“ASR”) module that receivesaudio input of a user utterance and generates one or more likelytranscriptions of the utterance. ASR modules typically use an acousticmodel and a language model. The acoustic model is used to generatehypotheses regarding which words or subword units (e.g., phonemes)correspond to an utterance based on the acoustic features of theutterance. The language model is used to determine which of thehypotheses generated using the acoustic model is the most likelytranscription of the utterance based on lexical features of the languagein which the utterance is spoken.

FIG. 3 is a block diagram depicting illustrative components of anembodiment of the navigation generator implemented by the contentenhancement engine shown in FIG. 1. The navigation generator 300 shownin FIG. 3 provides a detailed view of one implementation of thestructures which may be included in a navigation generator to provide atleast some of the described features. The navigation generator 300 shownin FIG. 3 is a simplified example in which extraneous elements have beenomitted to focus the reader on the specific features discussed.Additional elements may be added without departing from the intendedscope of what constitutes a navigation generator.

A navigation processor unit 340 may be included to coordinate thefunctions of the navigation generator 300. For example, the navigationprocessor unit 340 may be configured to coordinate the process ofreceiving requests for navigation content (e.g., item identifier,content showing the item, etc.) and generating one or more interfacesincluding the requested content.

The navigation generator 300 shown in FIG. 3 includes a networkinput/output interface (“network I/O”) 335. The network I/O 335 isconfigured to transmit and receive communications via a network such asthe Internet, a local area network, a wide area network, a peer-to-peernetwork, a cellular network, a satellite network, a public switchedtelephone network, any combination thereof, or other communicationsystem for exchanging data between electronic devices. In oneimplementation, the network I/O 335 is configured to receive content (oridentifiers thereof) and requests for enhanced content, transmitenhanced content such as a navigation interface for presentation via thedisplay 400, and communicate request and response messages for accessdevice information that can be used to generate the navigationinterface, as will be described in further detail below.

An item manager 310 may be included in the navigation generator 300 toobtain and organized information for an item. For example, for a givenitem, there may be a variety of content that can be included in thenavigation interface. In some implementations, the content may beobtained from the catalog system 110 and/or other content source such asa networked search engine. An asset collector 315 may be included toretrieve content from remote locations and, in some implementations, thecache 395. In the implementation shown in FIG. 3, the item manager 310and the asset collector 315 are implemented as instructions residing ina memory 350 that, when executed, cause the navigation generator 300 toperform the functions described. In some implementations, one or both ofthe item manager 310 and the asset collector 315 may be implemented asone or more hardware devices (e.g., one or more circuits) configured toperform the functions described.

Whereas the item manager 310 is configured to retrieve and maintain thecontent that will be included in a navigation interface for a specifieditem, an interface generator 320 may be included to generate thespecific enhanced content including display instructions for thenavigation interface. In the implementation shown in FIG. 3, theinterface generator 320 is implemented as instructions residing in thememory 350 that, when executed, cause the navigation generator 300 toperform the functions described. In some implementations, the interfacegenerator 320 may be implemented as a hardware device (e.g., circuit)configured to perform the functions described.

The interface generator 320 may receive a notification message from theitem manager 310 when the content for an item has been identified. Theinterface generator 320 may be configured to generate a generalinterface for presenting the content. The interface generator 320, ingenerating the interface, may be configured to generate a preview image(e.g., thumbnail) for a content element. The preview image includes asmaller representation of the item shown in content. The representationmay be smaller in size (e.g., dimension of the image) or in resourcerequirements to store and/or transmit (e.g., memory and/or bandwidth).Generating the preview image may include one or more of scaling thecontent for which the preview is being generated, changing the format ofthe encoding used for the content, compressing the content, or the like.In some implementations, the preview may include a series of images suchas a video file, animated image, or HTML video.

The interface generator 320 may be configured to identify and removeredundant content. For example, a request for content describing an itemmay be transmitted to the catalog system 110. In response, a set ofcontent may be received. The set of content may include two images thatprovide the same or substantially similar views of the item. Theredundancy between the two images may be detected by generating a hashvalue for each image and comparing the respective hash values. If thehash values are the same, the images may be deemed duplicates. Twoimages may be identified as redundant is through comparison of pixelsbetween the images. For example, random locations within the images maybe sampled. The number of locations may be specified using the interfaceconfiguration. The pixel information at the random locations from therespective images may then be compared. If all or a predeterminedportion of the locations are correlated (e.g., similar pixelinformation), the images may be deemed redundant. The predeterminedportion may be specified using the interface configuration. Other oradditional redundancy checks may be included such as color profilecomparison (e.g., histogram or average color values), object detection,comparison of image masks for the images, and the like.

If the content included in the set of content is audio, redundancy maybe detected based on comparison of one or more of hash values for therespective audio files, the waveforms of the audio included in the audiofiles, metadata for the audio file (e.g., title, author, file name, filesize, file format, etc.), or the like. In some implementations, atranscription of the audio files may be obtained such as via anautomatic speech recognition system. The transcripts of the two audiofiles may then be compared and, if a sufficient portion of thetranscripts are similar, the files may be deemed redundant.

Video content generally is a composite of audio and image data. As such,similar redundancy identification techniques as described above withreference to image and audio content may be used to identify redundantvideo content elements.

Once redundant content is identified, the interface generator 320 may beconfigured to select one of the redundant content elements for inclusionin the display 400. The selection may be based on a navigationconfiguration provided to the navigation generator 300. For example, thenavigation configuration may include information configuring theinterface generator 320 to favor content having certain characteristics(e.g., resolution, dimensions, format, sample rate, length, etc.) overothers. In some implementations, the selection may be based oncharacteristics of the access device 200 that will be receiving thedisplay 400. For example, if the access device 200 has limitedresources, it may be desirable to remove a large, resource intensiveversion of the content in favor of a smaller, more efficient version ofthe content.

The interface generator 320 may be configured to generate interactivecontrol elements for inclusion in the display 400. The interactivecontrol elements may include hyperlinks, buttons, icons, data inputfields (e.g., text field, drop down, file uploader), or the like. Theinteractive control elements may be associated with content. Forexample, a preview image may be interactive such that when aninteraction is detected with the preview image, the display 400 isadjusted. For example, the display 400 may change the visibility of anindicator for a portion of an item shown in the preview image interactedwith.

In some implementations, it may be desirable to generate specificversions of the navigation interface tailored for the access device thatwill be receiving and presenting the display information. In suchimplementations, an access device detector 325 may be included. Theaccess device detector 325 is configured to detect access deviceinformation such as display dimensions, display orientation, anidentifier for an application being used to display the interface,resource level for the access device (e.g., power, network bandwidth,processor bandwidth, memory, etc.), and the like.

In the implementation shown in FIG. 3, the access device detector 325 isimplemented as instructions residing in the memory 350 that, whenexecuted, cause the navigation generator 300 to perform the functionsdescribed. In some implementations, the access device detector 325 maybe implemented as a hardware device (e.g., circuit) configured toperform the functions described.

The access device detector 325 may detect access device informationbased on a received request for enhanced content. The request mayinclude header information or other metadata providing information aboutthe access device 200, application of the access device that generatedthe request, and resources (e.g., network type) available to the accessdevice 200.

In some implementations, the access device detector 325 may obtainadditional information about the access device 200. The additionalinformation may be obtained using information about the access device200 included in the request such as a device identifier or username.Based on this information, the access device 200 information can beretrieved. In some implementations, the information may be retrievedfrom the access device 200 based on a request from the access devicedetector 325 transmitted to the access device 200. The request mayinclude the information elements the access device detector 325 isseeking. In some implementations, this information may be included aspart of session initiation (e.g., session initiation protocol session;real time transport protocol session) between the access device 200 andthe content enhancement engine 1700.

In some implementations, the access device detector 325 may obtain theaccess device information from the catalog system 110. For example,where the navigation generator 300 is providing the display 400 to thecatalog system 110 which then forwards the enhanced content to theaccess device, the catalog system 110 may store information about theaccess device 200 and/or its user. Such information may be provided tothe access device detector 325 as part of the request message thatinitiates the generation of the display 400. In some implementations,the access device detector 325 may request this information from thecatalog system 110.

In some implementations, the access device detector 325 may identify aservice provider for the access device 200. The service provider may be,for example, an internet service provider or wireless service provider.The service provider may present an interface to service requests foraccess device information. In some implementations, the access devicedetector 325 may provide information about the access device (e.g., IPaddress, device identifier, etc.) in a message requesting additionalaccess device information. In response, the service provider may provideall or some of the requested information.

In some implementations, the service provider may provide information tohelp control the flow of traffic through its network. As such, theservice provider may provide information about the traffic that may bepermitted for an access device rather than specific details about aparticular access device. This may include the types of content that canbe received by the access device 200, limits on the quantity of contentthat can be received by the access device 200, network speed for theaccess device 200 (e.g., 3G, 4G, LTE, enhanced LTE, Wi-Fi, etc.),whether the access device 200 is roaming, and the like. This informationcan be used by the navigation generator 300 to identify what contentshould be included in the display 400.

Whether the navigation interface is general or custom, the interfacegenerator 320 provides the display 400 as an output. The display 400 maybe provided in machine readable format for presentation via the accessdevice 200. In some implementations, the machine readable format may beHTML or another mark-up language. Within the machine readable format,the content to be displayed, the location of where to display thecontent, interactive control elements associated with one or more of thecontent elements to be displayed, textual information, audioinformation, or the like may be identified.

FIG. 4 is a pictorial diagram of an example display generated by thenavigation engine of FIG. 3. The display 400 includes a main displayzone 420 and a film strip display zone 404. The main display zone 420 isshown presenting an image of an item 422. The display 400 may includeinformation identifying the image of the item 422 as the initial imageto show because it provides the full view of the item 422. Within themain display zone 420, four points of interest are identified, 426 a,426 b, 426 c, and 426 d. The points of interest correspond to contentshowing detailed views of the item at the respective areas. Based onimage analysis of the main image and the content showing the detailedviews, visual indicia of the points of interest may be included withinthe main display zone 420. The image analysis may identify what areacontent showing a detailed view is illustrating by comparing features ofthe detailed view with the main view. Features may include comparingcolor, color transitions, color gradient locations, color gradientorientations, edges, or identifiable reference points for the item(e.g., for a print of a work of art, a corner of the print). The visualindicia provide context to allow intuitive browsing of the variouscontent.

For example, if an interaction (e.g., click, tap, selection) of thepoint of interest 426 a is detected, the main display zone 420 may beadjusted to replace an overview image 440 of the item 422 with adetailed view 406 a. The adjustment may include animating a transitionfrom the overview image to the detailed view 406 a such as by zoominginto the detailed view. This may be achieved by interpolation and/orblurring from the initial view to the detailed view 406 a. If anadditional detail view exists, the same process may be repeated to zoomin further.

The display 400 may also include a control element to zoom out. Inzooming out, the animation may be reversed to adjust the main displayzone 420 to return to a broader view of the item 422 showing morefeatures than the detailed view such as the detailed view 406 a. In thisway, the display 400 provides orienting context to the different contentshowing the item.

The film strip display zone 404 may include more thumbnails (e.g.,preview images) than can be shown concurrently with the main view. Insuch implementations, a control element 410 may be included to permitadjustment of the thumbnails shown in the film strip display zone 404.As one thumbnail is removed from the film strip display zone 404, theremaining thumbnails are moved up and a new thumbnail is added to thefilm strip display zone 404. While the film strip display zone 404 shownin FIG. 4 is a vertical film strip display zone 404, it will beappreciated that the film strip display zone 404 may be orientedhorizontally or at an angle depending on, for example, access deviceinformation such as the orientation of the access device, orientation ofthe display device, display device shape, etc.

In some implementations, the previews included in the film strip displayzone 404 may include graphical indicium of the type of contentpreviewed. For example, preview of detailed view 406 e includes aplayback icon 408. This icon 408 may indicate that the content previewedcan be played back (e.g., is audio or video content). In someimplementations, an icon or other indicium indicating the content isenhanced content may be included.

Returning to FIG. 3, the navigation generator 300 may further include apower source 380. The power source 380 may include a wired or wirelesssource of power sufficient to supply operational power to the elementsdescribed. In some implementations, the power source 380 may include abattery or other rechargeable source. In some implementations, the powersource 380 may include a solar panel array.

The elements of the navigation generator 300 are coupled by a bus 390.The bus 390 is configured to allow the elements to exchange data (suchas the messages described herein) and/or power. In some implementations,parallel busses may be included, one for data and one for power.

In some implementations, the navigation generator 300 may include thememory 350. The memory 350 may be implemented as non-transitorycomputer-readable memory configured to store executable instructions.The memory 350 may include RAM, ROM, and/or other persistent,non-transitory computer readable media. The memory 350 can store anoperating system that provides computer program instructions for use bya navigation processor unit 340 or other elements included in thenavigation generator 300 in the general administration and operation ofthe navigation generator 300. The memory 350 can further includecomputer program instructions and other information for implementingaspects of the present disclosure. In such a configuration, thenavigation processor unit 340 may be in data communication with thecomputer-readable memory storing the executable instructions. Thenavigation processor unit 340 may be programmed by the executableinstructions to implement all or a portion of the features described.

The navigation generator 300 may include or be in data communicationwith a cache 395. The cache 395 may be a specially architected memorydesigned to store content received by the navigation generator andenhanced content generated by the navigation generator 300.

The navigation generator 300 may also receive a navigationconfiguration. The navigation configuration allows the navigationgenerator 300 to dynamically adjust how item content is processed togenerate enhanced navigation interfaces. For example, the navigationconfiguration may identify layouts for different access device types.The navigation configuration may identify cache management features suchas quantity of data to cache, duration to cache data, cache rotationscheme (e.g., first-in-first-out, largest out first, etc.), and thelike. Multiple configurations may be specified using the navigationconfiguration. When a request for a navigation interface is received,the request may include identifying information for which the navigationinterface will be generated. This can allow different companies to offerthe enhanced navigation features described using a custom look-and-feel.

FIG. 5 is a flow diagram depicting an illustrative routing executed bythe navigation generator for generating content for an item. The process500 shown in FIG. 5 illustrates how an enhanced navigation interface canbe generated for presenting content related to an item. The process 500shown in FIG. 5 may be implemented in whole or in part by a navigationgenerator such as discussed in FIGS. 1 and 3 above to generate a displaysuch as those discussed and shown in FIG. 4. Some or part of the process500 shown in FIG. 5 may be implemented by or in conjunction with acontent enhancement workstation such as shown in FIG. 1 and discussedabove.

The process 500 begins at block 502. At block 505, an item is identifiedfor display. The item may be identified using an item identifierprovided in a message received from the access device 200 or the catalogsystem 110. In some implementations, the message may include a searchresult received from the catalog system 110. The identification mayinclude parsing the message to extract the item identificationinformation. Examples of item identifiers include Amazon StandardIdentification Numbers (ASINs), European Article Numbers (EANs),International Standard Book Numbers (ISBNs), Universal Product Codes(UPCs), or the like. In some implementations, the identification of theitem may be based on an image of the item. Image recognition based onthe pixel information included in the image of the item may be used toidentify the item. For example, a color histogram of the image of theitem may be compared to a database of color histograms for previouslyidentified items. By correlating characteristics of a color histogramfor a known item with characteristics of a color histogram for anunknown item, the identity of the unknown item may be determined.

At block 510, first content demonstrating the item and second contentdemonstrating the item are obtained. The first content and the secondcontent may, respectively, include one or more of audio, video, text, orother content demonstrating features of the item. The content may beobtained from the item data store 126 of the catalog system 110.Obtaining the content from the item data store 126 may includetransmitting a message including information identifying the item andreceiving the content or a reference (e.g., URL) to the content. In someimplementations, a set of two or more content elements may be obtainedfor demonstrating the item. The content obtained at block 510establishes the available content that can be provided via thenavigation interface.

At block 515, the first content is identified as the primary view of theitem. The identification of content as the primary view may be based onpixel analysis of the available content for the item including imageinformation. The pixel analysis may include object recognition wherebydescriptive information about the item (e.g., metadata or a descriptionfrom the item data store 126) indicates a type for the item. The typemay be “shirt” or “book” or “game” or the like. The object recognitionmay utilize an image and the type to determine how much of the item isshown in the image content. For example, content including a detailedview of an item does not provide as holistic a view of the item ascontent including an overview of the item. In FIG. 4, the overview image440 shown in the main display zone 420 shows the complete item while thedetailed view 406 a shows only a portion of the item 422. In suchinstances, the object recognition may identify a shirt in the overviewimage 440 but not in the detailed view 406 a.

The pixel analysis may include generating region masks. A region maskidentifies contiguous regions including pixel information for an item.The mask for the detailed view 406 a would be truncated by the top ofthe image. Furthermore, the shape of the mask would be irregular andinconsistent with the item type (e.g., shirt). In contrast, the maskformed from the overview image 440 would form a complete shape withspace between the mask and the image borders. The presence of spacebetween each image border alone or in conjunction with assessment of theshape of the mask may be used to confirm the overview image 440 as theprimary view.

In some implementations, the pixel analysis may include featuredetection. The features of an item appearing in each of at least twoimages may be identified and then aligned. The features may includecorners, blobs, Harris corners, difference of Gaussians of corners orHarris corners, or the like. In some implementations, the pixel analysismay include object detection. The object detection may includeidentifying objects using histogram of oriented gradients or scaleinvariant feature transforms (SIFT).

In some implementations, more than one image may provide a full view ofthe item. For example, if a first image shows the front of the shirt anda second image shows the back of the shirt, the quantity of item shownin each could be deemed equal. In such instances, further analysis ofthe images may be conducted to distinguish between the possiblecandidates. One analysis may consider characteristics of the images suchas resolution, dimensions, blurriness, color, format, size for storage,etc. The analysis may identify the image having the most detail (e.g.,highest resolution, largest dimensions, least blurriness, most color,highest quality format, highest storage size, etc.) as the primary view.In some implementations, an image may be removed from consideration asthe primary image if the image has a characteristic that falls below apredetermined threshold. The threshold may be specified in thenavigation configuration.

In some implementations, the analysis to identify a primary image frommultiple images may include determining an amount of detail shown ineach image. The amount of detail may be determined using pixelinformation in the image. For example, the color changes caused byshowing details such as seams, zippers, characters, illustrations, orother ornamental features can be used to determine a level of detail.The changes may be absolute color changes between two or more pixels.The changes may be expressed as a gradient identifying change over arange of pixels (e.g., linear color gradient for pixels on a linebetween the lower left corner of an image and the upper right corner ofthe image). The image including more detail would be identified as theprimary image. In some implementations, an image may be removed fromconsideration as the primary image if the image has a level of detailthat falls below a predetermined threshold. The threshold may bespecified in the navigation configuration.

In some implementations, the identification of an image as the primaryview may be performed via the content enhancement workstation 170. Insuch implementations, the first image and the second image may bereceived by the content enhancement workstation 170. The contentenhancement workstation 170 may obtain the images, such as describedwith reference to block 510. The content enhancement workstation 170 maygenerate and present an interface for receiving a selection of an imageas the primary view. Upon detecting the selection such as via an inputdevice coupled with the content enhancement workstation 170, theselection may be transmitted to the content enhancement engine 1700 forfurther processing.

At block 520, the second content is mapped to a location or region inthe first content. Because the first content has been identified as theprimary image, the remaining content received may be presented in thenavigation interface with reference to the primary image. For example,as shown in FIG. 4, the main display zone 420 includes the overviewimage 440 showing the item 422. Additional content showing detailedviews (e.g., those in the film strip zone 404) can be mapped tolocations within the overview image 440 shown in the main display zone420. The mapping may include pixel analysis of the second content andidentifying a location in the first content with an organization ofpixels corresponding to the second content. The correlation may includescaling one or both of the first or second content for analysis.

The pixel analysis may include feature detection. The features of anitem appearing in each of at least two images may be identified and thenaligned. The features may include corners, blobs, Harris corners,difference of Gaussians of corners or Harris corners, or the like. Insome implementations, the pixel analysis may include object detection.The object detection may include identifying objects using histogram oforiented gradients or scale invariant feature transforms (SIFT).

In some implementations, the mapping of block 520 may be performed viathe content enhancement workstation 170. The content enhancementworkstation 170 may generate and present an interface for receiving aselection of an area of the first image showing the portion of the itemdepicted in the second image. For example, the interface may display thefirst image and receive a bounding box selection identifying the area ofthe first image showing the portion of the item depicted in the secondimage.

In some implementations, the content enhancement workstation 170 mayreceive mapped to locations detected via pixel analysis. In suchimplementations, the content enhancement workstation 170 may be used toconfirm the machine generated mapping. If an adjustment to the mappingis received from the content enhancement workstation 170, the adjustmentinformation may be used to refine the pixel analysis for subsequentimages. The refinement may include retraining a model for mappingimages.

The mapping may be performed relative to the primary view. In suchimplementations, the location information may be provided with referenceto location within the first content. In some implementations, contentmay be mapped to multiple views. For example, in addition to thedetailed view 406 b of a pocket, content showing a further detailed viewof the pattern appearing on the pocket may be provided. In suchimplementations, the mapping may be repeated for the content includingthe view of the pattern.

The mapping may be represented in a hierarchy starting with the primaryview as the root node of the hierarchy. From the root node, each contentelement to be included in the display 400 may be represented as a childnode. The child nodes may include the location information indicatingthe portion of the parent (e.g., root node) shown in the content. Incases where a further detailed view of a child node is available,another child node may be associated with the initial detailed view nodeand include similar location information. In implementations includingthe content enhancement workstation 170, an interface for editing thehierarchy may be provided to generate the mapping. The interface mayallow selection of a node within the hierarchy and moving the selectednode to a location within the hierarchy to indicate the relationshipbetween content associated with the node and at least one other node inthe hierarchy.

FIG. 6 is a pictorial diagram of a hierarchy mapping generated by thenavigation engine associating content elements to locations in anoverview image. The hierarchy 600 includes content shown in FIG. 4beginning at a root node with the overview image 440. From there,detailed views 406 a, 406 b, 406 c, and 406 d are associated with x, ylocations within the overview image 440. The x, y locations identify anupper-left pixel within the overview image 440 of an area shown by theassociated detailed view. The x, y location information may be providedas coordinates, percentages, or other information sufficient to locatean image in an image associated with its parent node. Detailed view 406f shows how a child node of the overview image 440 can also beassociated with an additional detailed view. In FIG. 6, the detailedview 406 f is associated with location information for its immediateparent node (e.g., the detailed view 406 b). In some implementations, itmay be desirable to include location information for the detailed view406 f for all nodes along a path to the root node. In this way, thedetailed view 406 f may be accurately associated with content whendisplayed via the display 400. The hierarchy 600 may be used to identifya relationship between multiple content items for a location. In someimplementations, the nodes included in the hierarch 600 may beassociated with interactions. For example, where two content elementsare included at a location, a first interaction may cause presentationof the first element while a second interaction may cause presentationof the second element.

Returning to FIG. 5, at block 525, presentation of the first content andan identifier at the location of the second content is caused. In someimplementations, this may include transmitting a message including thepresentation information to the access device 200. The presentation mayinclude instructions to launch an application on the access device 200for rendering the navigation interface on the display 400. Theidentifier for the location shown in the second content may be a box orother visible indicia. In some implementations, a veil may be presentedover the location shown by the second content. The presentation ofidentifiers may be toggled using a control element also presented on thedisplay 400. When activated, the visible indicia are shown as overlaysto the primary view. When deactivated, the indicia may be hidden. Insome implementations, a preview of content showing the detailed view ofthe item (e.g., second content) may be presented via the display 400. Insuch implementations, if an interaction such as a mouse over event isdetected with the preview of the content showing the detailed view ofthe item, the visible indicium for the location shown in the contentshowing the detailed view interacted with may be activated within themain display zone 420.

The location may also be associated with an interaction. In someimplementations, the interaction may be a select or click event. Atblock 530, an interaction with the location is detected. An interactionmay include location information indicating where the interaction wasdetected. An interaction may include an interaction type such as aclick, double-click, identification of which button was clicked, a tap,a gesture, etc. An interaction may also include a direction.

Once the interaction is detected (e.g., determined to be within thelocation and/or of a predetermined type), at block 535, the secondcontent is presented in place of the first content. As discussed above,the presentation may include animating motion from content showing anoverview of an item to the selected content showing a detailed view ofthe item. This helps orient the user to where the navigation is takingthem. In cases where the second content is a video or audio contenttype, upon presentation in the main display zone 420, playback may beautomatically initiated. At block 590, the process ends.

The display 400 may include hints indicating the availability of contentto facilitate navigation of different content for an item as discussedabove. Once content is presented, additional hinting may be displayedsuch as a hint indicating how to interact with enhanced content.

FIG. 7 is a block diagram depicting illustrative components of anembodiment of a content hint generator implemented by the contentenhancement engine shown in FIG. 1. The content hint generator 700 shownin FIG. 7 is a simplified example in which extraneous elements have beenomitted to focus the reader on specific features discussed. Elements maybe added, removed or modified without departing from the intended scopeof what constitutes a content hint generator.

A hinting processor unit 740 may be included to coordinate the functionsof the content hint generator 700. For example, the hinting processorunit 740 may be configured to coordinate the process of receivingrequests for hints for enhanced content and generating one or more hintsfor the enhanced content.

The content hint generator 700 shown in FIG. 7 includes a networkinput/output interface (“network I/O”) 735. The network I/O 735 may beconfigured to transmit and receive communications via a network such asthe Internet, a local area network, a wide area network, a peer-to-peernetwork, a cellular network, a satellite network, a public switchedtelephone network, any combination thereof, or other communicationsystem for exchanging data between electronic devices. In someimplementation, the network I/O 735 is configured to receive content (oridentifiers thereof) and requests hints for content, transmit hints forcontent, and communicate request and response messages for access deviceinformation that can be used to generate hints for content, as will bedescribed in further detail below.

Hints may be graphical, audio, textual, haptic, or other detectableindicia produced by the access device to suggest the interactionavailable for enhanced content. The hint may be generated so as toinvite user interaction with the enhanced content. The hint may providean indication of the type of interaction needed to activate the enhancedcontent. For example, if the enhanced content includes perspective viewsof an item to correspond to the angle of the access device, icons may beincluded in the hint suggesting movement of the access device. The hintmay provide an indication of a location and a direction of interactionneeded to activate the enhanced content. For example, if enhancedcontent demonstrating unzipping of a zipper is available for an item,the hint may provide a visual indicium near the zipper along with anarrow pointing in the direction the zipper can be unzipped. In someimplementations, the hint may include a preview of what the enhancedcontent will show when there is interaction with the hint. In the zipperexample, the arrow may be animated to move along the zipping direction.In some implementations, the movement may be accompanied with ghost orphantom lines indicating how the item will be shown during theinteraction. In some implementations, a portion of the enhanced contentmay be displayed providing, via the hint, a preview of the enhancedcontent.

FIG. 8 is a pictorial diagram showing an example of a content hintgenerated by the content hint generator and displayed in conjunctionwith content for an item. The content may be enhanced content showing asweatshirt 802. The enhancement may be to simulate the opening of thetop of the sweatshirt 802 to allow a user to see the inside of thesweatshirt 802. For such enhanced content, it may be desirable toprovide a hint 804 that suggests with which part of the sweatshirt 802 auser can interact and what kind of interaction will activate theenhancement. The hint 804 is shown as a graphic overlay on an image ofthe sweatshirt 802.

Returning to FIG. 7, to generate a hint, such as the hint 804, thecontent may be analyzed by a content analyzer 715. In the implementationshown in FIG. 7, the content analyzer 715 is implemented as instructionsresiding in a memory 750 that, when executed, cause the content hintgenerator 700 to perform the functions described. In someimplementations, the content analyzer 715 may be implemented as ahardware device (e.g., circuit) configured to perform the functionsdescribed.

The content analyzer 715 may be configured to analyze attributes of thecontent to identify the appropriate hint for the content. The attributesof the content that may be analyzed include metadata for the content,content type (e.g., image content, video content, textual content, audiocontent, enhanced content) of the content, pixel and/or audio dataincluded in the content, the item shown in the content, or a type (e.g.,clothing, book, hardware, footwear, etc.) for the item shown in thecontent.

In some implementations, the metadata included with the content mayindicate the enhancements for the content such as interactive areas,type of interaction, etc. This metadata about the enhancements may beextracted from the content by the content analyzer 715 and provided to ahinting information generator 720.

In some implementations, the content analyzer 715 may detect the type ofthe content. The content analyzer 715 may detect the type of the contentusing a file name of the content whereby a naming convention may be usedto infer the content type for content. A content hint configuration maybe provided to the content hint generator 700. The content hintconfiguration may include a mapping of file type information to contenttypes. The content analyzer 715 may use this configuration informationto infer the content type using the file name of the content. Similarconfiguration information may be used to infer content type from otherattributes for the file such as author, size, location, and the like.

The content analyzer 715 may analyze information included in thecontent. For example, animated content may include certain sequences ofbytes within the file containing the content. These sequences may bedetected by the content analyzer 715 and used to categorize the contentand/or determine the enhancements thereto. The content analyzer 715 maybe configured to analyze the information encoded by the content. Forexample, if the content includes a series of images, the contentanalyzer 715 may be configured to identify the ways the images differover the series to identify a location within the content that changesand/or how the image shown at the location changes over the series. Theinformation about where and how the content changes can be provided tothe hinting information generator 720 to guide the placement of the hintfor the content to the location where the content is changing.

The hinting generator 720 shown in FIG. 7 is implemented as instructionsresiding in the memory 750 that, when executed, cause the content hintgenerator 700 to perform the functions described. In someimplementations, the hinting generator 720 may be implemented as ahardware device (e.g., circuit) configured to perform the functionsdescribed.

In some implementations, the hinting information generator 720 maygenerate the hint using information received or detected from the accessdevice that will be presenting the hint. For example, it may bedesirable to provide hints that can be presented via the access device.Consider, for example, an access device that does not include a hapticoutput device. For such access devices, it may be desirable to excludehaptic hinting because such a hint would be ignored by the accessdevice. Consider, as another example, an access device that is accessingthe content via a low bandwidth network and, perhaps, while roaming. Forsuch access devices, it may be desirable to keep the quantity of dataprovided to the access device as low as possible. In such instances, itmay be desirable to generate resource efficient hints or omit themaltogether. When considering access device information, the content hintconfiguration may include threshold values for one or more access deviceresource levels and/or types. By comparing a threshold value to thedetected or received value for the access device, the hint informationgenerator 720 can dynamically provide the hint tailored for the accessdevice.

The hint information generator 720 may also tailor the hint to thespecific interactions that are detectable by an access device. Forexample, if the access device is a smartphone, the smartphone maysupport detection of touch events on a display. However, if the accessdevice is a desktop computer, touch events may not be supported. In eachcase, how the interaction is detected (e.g., touch versus mouse event)may differ. These differences may lead to different hints depending onthe access device that will be rendering the hint. The differences maybe in the actual hint presented (e.g., graphics, sound, or haptics)and/or in how the hint is activated.

The access device specific hinting may be omitted in someimplementations. In some implementations, a default hint tailoring maybe provided. For example, where the hinting information generator 720generates hints for access devices associated with a specific catalogsystem, the catalog system may track common access devices for theirusers. This information can be used to generate an average or defaultaccess device profile including the characteristics most commonlyassociated with the access devices used for the catalog system.

An access device detector 725 may be included to receive and/or detectthe access device information. Similar to the access device detector 325shown in FIG. 3, the access device detector 725 may be configured todetect access device information such as display dimensions, displayorientation, an identifier for an application being used to display theinterface, resource level for the access device (e.g., power, networkbandwidth, processor bandwidth, memory, etc.), and the like.

The access device detector 725 shown in FIG. 7 is implemented asinstructions residing in the memory 750 that, when executed, cause thecontent hint generator 700 to perform the functions described. In someimplementations, the access device detector 725 may be implemented as ahardware device (e.g., circuit) configured to perform the functionsdescribed.

The access device detector 725 may detect access device informationbased on a received request for hint. The request may include headerinformation or other metadata providing information about the accessdevice 200, application of the access device that generated the request,and resources (e.g., network type) available to the access device 200.

In some implementations, the access device detector 725 may obtainadditional information about the access device 200. The additionalinformation may be obtained using information about the access device200 included in the request such as a device identifier or username.Based on this information, the access device 200 information can beretrieved. In some implementations, the information may be retrievedfrom the access device 200 based on a request from the access devicedetector 725 transmitted to the access device 200. The request mayinclude the information elements the access device detector 725 isseeking. In some implementations, this information may be included aspart of session initiation (e.g., session initiation protocol session;real time transport protocol session) between the access device 200 andthe content enhancement engine 1700.

In some implementations, the access device detector 725 may obtain theaccess device information from the catalog system 110. For example, thecatalog system 110 may request hints and include them in a response tothe access device 200. In such implementations, the catalog system 110may store information about the access device 200 and/or its user. Suchinformation may be provided to the access device detector 725 as part ofthe request message that initiates the generation of the hint. In someimplementations, the access device detector 725 may request thisinformation from the catalog system 110.

In some implementations, the access device detector 725 may identify aservice provider for the access device 200. The service provider may be,for example, an internet service provider or wireless service provider.The service provider may present an interface to service requests foraccess device information. In some implementations, the access devicedetector 725 may provide information about the access device (e.g., IPaddress, device identifier, etc.) in a message requesting additionalaccess device information. In response, the service provider may provideall or some of the requested information.

In some implementations, the service provider may provide information tohelp control the flow of traffic through its network. As such, theservice provider may provide information about the traffic that may bepermitted for an access device rather than specific details about aparticular access device. This may include the types of content that canbe received by the access device 200, limits on the quantity of contentthat can be received by the access device 200, network speed for theaccess device 200 (e.g., 3G, 4G, LTE, enhanced LTE, Wi-Fi, etc.),whether the access device 200 is roaming, and the like. This informationcan be used by the content hint generator 700 to identify what types andquantities of hint information may be provided to the access device 200.

The content hint generator 700 may further include a power source 780.The power source 780 may include a wired or wireless source of powersufficient to supply operational power to the elements described. Insome implementations, the power source 780 may include a battery orother rechargeable source. In some implementations, the power source 780may include a solar panel array.

The elements of the content hint generator 700 are coupled by a bus 790.The bus 790 is configured to allow the elements to exchange data (suchas the messages described herein) and/or power. In some implementations,parallel busses may be included, one for data and one for power.

In some implementations, the content hint generator 700 may include thememory 750. The memory 750 may be implemented as non-transitorycomputer-readable memory configured to store executable instructions.The memory 750 may include RAM, ROM, and/or other persistent,non-transitory computer readable media. The memory 750 can store anoperating system that provides computer program instructions for use bya hinting processor unit 740 or other elements included in the contenthint generator 700 in the general administration and operation of thecontent hint generator 700. The memory 750 can further include computerprogram instructions and other information for implementing aspects ofthe present disclosure. In such a configuration, the hinting processorunit 740 may be in data communication with the computer-readable memorystoring the executable instructions. The hinting processor unit 740 maybe programmed by the executable instructions to implement all or aportion of the features described.

The content hint generator 700 may also receive a content hintconfiguration. The content hint configuration allows the content hintgenerator 700 to dynamically adjust how content is processed to generatehints. For example, the content hint configuration may identify hinttypes to use for different access device types. Multiple configurationsmay be specified using the content hint configuration. When a requestfor a hint is received, the request may include identifying informationfor which the hint will be generated. This can allow different companiesto offer the hint features described using a custom look-and-feel (e.g.,iconography, font, color, etc.).

FIG. 9 is a flow diagram depicting an illustrative routine executed bythe content hint generator for generating an enhanced content hint. Theprocess 900 shown in FIG. 9 illustrates how a hint can be generated forcontent related to an item. The process 900 shown in FIG. 9 may beimplemented in whole or in part by a content hint generator such asdiscussed in FIGS. 1 and 7 above to generate a hint such as thosediscussed and shown in FIG. 8. Some or part of the process 700 shown inFIG. 9 may be implemented by or in conjunction with a contentenhancement workstation such as shown in FIG. 1 and discussed above.

The process 900 begins at block 902. At block 905, a set of content eachshowing at least a portion of an item is received. The set of contentmay include enhanced content, image content, audio content, videocontent, textual content, etc. The content or references to allowretrieval of the content include in the set may be received. In caseswhere the set of content includes enhanced content, it may be desirableto provide a hint to indicate where and/or how the enhanced content canbe interacted with.

At block 910, image content included in the set of content is identifiedto serve as the reference point for the hinting. The process 900 shownand described in FIG. 9 uses the image content included in the set ofcontent to provide the hint for enhanced content included in the sameset of content. In some implementations, the identification of the imagecontent at block 910 may include generating image content showing theitem that is also shown in the enhanced content. In suchimplementations, the hint may be provided with reference to thegenerated image.

In some implementations, the identification of the image content may beperformed via the content enhancement workstation 170. In suchimplementations, the set of content may be received by the contentenhancement workstation 170. The content enhancement workstation 170 maygenerate and present an interface displaying at least a portion of theset of content. The interface may include control elements (e.g.buttons, links, or data entry fields) for receiving a selection of imagecontent to serve as the reference point for hinting. Upon receiving theselection, such as via an input device coupled with the contentenhancement workstation 170, the selection may be transmitted to thecontent enhancement engine 1700 for further processing.

At block 915, enhanced content is identified that shows a portion of theitem shown in the image content from block 910. In some implementations,the enhanced content may be identified before identifying the imagecontent. In such instances, the image content may be identified based onthe enhanced content. For example, the image content may be identifiedby comparing the portion of the item shown in enhanced content to aquantity of the portion of the item shown in other content included inthe set of content. The image content may be identified as the contentshowing the highest quantity of the portion of the item shown in theenhanced content. This can include a proportionality test whereby thecontent showing the highest proportion of the portion of the item isselected. For example, where a first image shows an overview of a shirtand a second image shows the collar of the shirt, when selecting animage to include hinting for enhanced content showing the collar, boththe first and the second image may show the entire collar. However, thesecond image, because it is a detailed image of the collar, more of theimage content in the second image is devoted to the collar. Accordingly,the second image may be identified as the content for conveying thehint.

In some implementations, the identification of the enhanced content maybe performed via the content enhancement workstation 170. In suchimplementations, several elements of enhanced content may be received bythe content enhancement workstation 170. In some implementations, thecontent enhancement workstation 170 may receive an identifier for theitem and, using the identifier, obtain the available enhanced contentfor the item from the catalog system 110 or other remote contentrepository. The content enhancement workstation 170 may generate andpresent an interface displaying at least some of the obtained enhancedcontent. The interface may include control elements (e.g. buttons,links, or data entry fields) for receiving a selection of the enhancedcontent showing a portion of the item of interest (e.g., shown in theimage content from block 910). Upon receiving the selection, such as viaan input device coupled with the content enhancement workstation 170,the selection may be transmitted to the content enhancement engine 1700for further processing.

At block 920, the enhanced content is mapped to a location in the imagecontent. The mapping may include mapping the area of interactivity fromthe enhanced content to an area in the image content.

FIG. 10 is a pictorial diagram showing how a location to place anenhanced content hint within content may be detected by the content hintgenerator by detecting an object shown in the enhanced content. Theenhanced content 1002 may include video content and associatedinteraction mappings to demonstrate the opening of the collar of theitem shown in the enhanced content 1002. The enhanced content 1002 mayinclude a series of images each showing different states for the collar.Three images that may be included in the series of images (e.g., images1004, 1006, and 1008) are also shown. In the example shown in FIG. 10,image content 1050 is identified for presenting the hint. As the item asshown in both images is the same, the mapping of the location of theinteraction may include identifying the location of the interaction inthe enhanced content. Since the images show the item in the same way,the location should correspond to a similar location on the imagecontent 1050. For example, if the location of the interaction for theenhanced content is centered at x,y pixel location 150, 150 in theenhanced content 1050, the location for placing the hint on the imagecontent would be 150, 150.

In some implementations, the image content 1050 may not show the sameview as presented in the enhanced content 1002. In such implementations,the location may be translated by scaling, interpolating, or other imageprocessing to align the image content 1050 with the portion of the itemshown in the enhanced content 1002.

To identify the location of the interaction, the enhanced content may beanalyzed, such as by a content analyzer (e.g., the content analyzer 715shown in FIG. 7). As shown in FIG. 10, image analysis of the first image1004 in the series and the last image 1008 in the series may beperformed. The image analysis may identify the differences between thesetwo images (e.g., 1004 and 1008). For the images shown in FIG. 10, thelocation of pixels that changed from the first to the last image may beidentified as a location 1055 for the hint. The difference may becompared to a threshold such that minor differences due to encoding,noise, or other non-substantive variables are ignored in favor of moresubstantial changes in how the image is shown in the images. Thethreshold may be specified via the content hint configuration.

As discussed above, a hint may provide the location of the interactionas well as a direction for the interaction to activate enhanced content.As part of the mapping at block 920, the direction of interaction may bemapped.

FIG. 11 is a pictorial diagram showing how an interaction directionindicated by an enhanced content hint may be detected by the contenthint generator as part of generating an enhanced content hint forenhanced content using optical flow of pixels within the enhancedcontent. The series of images included in the enhanced content 1002 maybe analyzed to determine not only where the images change through theseries, but also a direction in which the images change. Optical flow isone image analysis that may be used to detect motion between images inthe series of images. In some implementations, feature flow may be usedto detect motion. Where optical flow identifies movement of pixels,feature flow identifies movement of image features. Feature flow motiondetection may include identifying a feature shown in an image andtracking the feature between images in the series of images. In someimplementations, the features used for feature flow detection may beidentified using an area of interest. The area of interest may bepreconfigured (e.g., the center of the first image in the series ofimages), or provided as annotation information for an image. Forexample, the enhancement workstation 170 may present the image via aninterface. The interface may receive an indication of a feature to usefor feature flow motion detection for the image.

Feature or optical flow detection may include identifying motion vectorsbetween two images. The vectors may be represented on a map of the imagerelative to the location where the motion is detected. A first flow map1102 showing the motion detected between image 1004 and 1006 is shown inFIG. 11. A second flow map 1104 showing the motion detected betweenimage 1006 and 1008 is also shown in FIG. 11. One or more flow maps maybe used to detect the location and/or direction of movement within theenhanced content. Although FIG. 11 shows maps for images in sequence, itmay be desirable to map motion from between the first image and the lastimage in the series. While this map may not provide incremental detailsbetween intervening images, using only the first and the last images canprovide a computational efficiency that may be desirable in resourceconstrained operational environments.

In some implementations, information included in the enhanced contentmay be captured with an optical flow sensor. For example, a camera tocapture images for inclusion in the enhanced content while an opticalflow sensor (e.g., optical flow camera) may capture motion informationwithin the same field of view as the camera. The motion informationobtained by the optical flow sensor may be combined with the imagescaptured by the camera to form part of the enhanced content. In suchimplementations where motion information is included in the enhancedcontent, the mapping may use the motion information included in theenhanced content.

In some implementations, the mapping between the enhanced content andthe image content may be performed via the content enhancementworkstation 170. In such implementations, the enhanced content and theimage content may be received by the content enhancement workstation170. The content enhancement workstation 170 may generate and present aninterface presenting the enhanced content and the image content. Theinterface may include control elements (e.g. buttons, links, or dataentry fields) for receiving the mapping. For example, the image contentmay be dragged, resized, rotated or otherwise manipulated to overlay aportion of the enhanced content. The mapping may include storing themanipulations to the image content to indicate the relationship betweenthe image content and the enhanced content. Upon receiving the mapping,such as via an input device coupled with the content enhancementworkstation 170, the mapping may be transmitted to the contentenhancement engine 1700 for further processing.

In some implementations, the content enhancement workstation 170 mayreceive a mapping between the enhanced content and image contentgenerated through analysis and comparison of the enhanced content andthe image content. In such implementations, the content enhancementworkstation 170 may be used to confirm the machine generated mapping. Ifan adjustment to the mapping is received from the content enhancementworkstation 170, the adjustment information may be used to refineanalysis and/or comparison used for mapping images and enhanced content.The refinement may include retraining a model for mapping content.

Returning to FIG. 9, having identified the location of the interactionand, in some implementations, a direction of interaction, at block 925,a determination is made as to whether access device information isavailable. The access device information may be used to further tailorthe hint to be presented via the image content. As discussed above, thecapability of the access device or intended user of the access devicecan dictate the types and/or format of the hinting provided.

If no access device information is available, or if access devicespecific hinting is disabled (e.g., via the content hint configuration),at block 929, a default hint capability profile is obtained. The defaulthint capability profile may identify the standard hint type and formatto use for generating hints. In some implementations, the standard hinttype may include instructions for specifying alternative hints (e.g.,conditional hinting such that if the access device has haptics, thenprovide the haptic hint). Returning to decision block 925, if accessdevice information is available, at block 927, an access device specifichint capability profile is obtained. The access device specific hintcapability profile may include specific hint type and the format forspecifying hints (e.g., XML stylesheet, graphics commands, image format,HTML version supported, etc.).

At block 930, a content hint is generated. The content hint generatedindicates the availability of the enhanced content and how the enhancedcontent may be accessed from the image content. Generating the contenthint may include generating an animation demonstrating what portion ofthe item shown in the image content is enhanced and, in some instances,where the portion will move in the enhanced content. The animations mayprovide a “ghost” view of the enhanced content. For example, if theenhanced content includes video, a number of frames may be sampled fromthe video and used to provide the ghost view. The number of frames maybe selected from the start of the video or sampled at intervals fromthroughout the video. In some implementations, the enhanced content maybe associated with metadata including content hint information. Forexample, the metadata may include an action field identifying an action(e.g., opens, moves, or changes state) shown in the enhanced content.The action field information may be used to generate the hint such as bymapping actions to visual indicators for the action (e.g., metadataidentifying a move action may be mapped to an arrow graphic). In someimplementations, the hint may include playing a portion of the videosuch as via an alpha-channel image overlaid on the image content.

Generating the content hint may also include generating conditions uponwhich the hint will be displayed. It may be desirable to hide hints fromusers who are familiar with the system and/or content. Accordingly, oneor more conditions which, upon detection by the access device, willcause a change in the presentation (e.g., display) state of the indiciumassociated with the hint. For example, a graphical indicium for a hintmay be hidden when the content associated with the hint is initiallydisplayed. If no interaction at the location associated with a hint isdetected after a predetermined period of time (e.g., 5 seconds, 10seconds, 2 seconds), the graphical indicium for the hint may bedisplayed. Similarly, if an interaction is detected with a portion ofthe content that is not associated with the hint, the graphical indiciumfor hint may be hidden to liberate display resources for rendering thearea being interacted with rather than the hint area.

In some implementations, generating the content hint may include hintingfeedback. For example, if a user is presented with the content hint butis not performing the desired interaction to trigger the enhancedcontent, it may be desirable to provide hinting feedback. The hintingfeedback may include presenting an adjustment from the interactiondetected and the desired interaction. For example, if the user isswiping at a location too low, the hinting feedback may display ananimated line from the detected swipe location to the desiredinteraction location. In some implementations, it may be desirable toprovide hinting feedback if the user correctly performed the interactionindicated by the content hint. Such a feedback may include a hapticresponse, playback of audio, or presentation of an image.

At block 935, the content hint is provided for presentation inconjunction with the image content. The content hint may includeconfiguration values for the access device to accurately present thehint with the image content. For example, the content hint may specifythe location of the hint, graphic form for the hint (e.g., image file,audio file, haptic feedback, icon code, character, etc.) The contenthint may also include configuration values for the access device toaccurately detect an interaction with the hint for triggering theenhanced content. For example, the content hint may specify a type ofinteraction (e.g., tap selection, shake, tilt, drag, gesture, eyemovement, etc.), direction of interaction, or magnitude of interactionto activate the enhanced content.

One way content may be enhanced is by generating a physics model for thecontent to introduce interactivity for the content that simulates the“physics,” e.g., physical processes, properties, manipulation orphenomena, of or applied to the item shown in the content. For ease ofreference any one or combination of the foregoing may be referred toherein as a “physics property.” For example, when the content includesvideo content, a physics model may be generated to associate playbacklocations in the video content with interactions the content, whichinteractions simulate physical manipulation of the item shown in thecontent. For example, video content of an item recorded at a constantheight and distance from the item may show the item from all 360degrees. Playback of the video content may be linear, that is, startedand stopped with sequential playback of the content. However, physicalmanipulation of the item can be simulated by changing properties of theplayback such as the speed, location, or scale of the images shown. Thechanges may be dynamic and in response to interactions detected with theaccess device displaying the content. For example, if a user slides herfinger to the left while the 360 degree video of the item is beingdisplayed so that the items appears to be rotating to the right,playback of the video content may be reversed so as to simulate spinningthe item to the left, with the speed of the playback based on themagnitude of the swipe. For example, if the detected interaction is aslow, one centimeter swipe to the left, the video content may bedisplayed more slowly as to simulate a slow spin. In contrast, if afast, swipe to the left across the entire screen is detected, the videocontent may be displayed more quickly so as to simulate more rapidspinning of the item.

FIG. 12 is a block diagram depicting illustrative components of anembodiment of a physics engine implemented by the content enhancementengine shown in FIG. 1. The physics engine 1200 shown in FIG. 12provides a detailed view of one implementation of the structures whichmay be included in a physics engine to provide at least some of thedescribed features. The physics engine 1200 shown in FIG. 12 is asimplified example in which extraneous elements have been omitted tofocus the reader on the specific features discussed. Elements may beadded, modified or removed without departing from the intended scope ofwhat constitutes a physics engine.

Generating a physics model can be a two-step process. First, the contentfor which the physics model is provided is analyzed to identify theattributes of the item that can be represented in the physics model. Forexample, texture of an item may be represented in the physics model withhaptic feedback. As another example, elasticity of an item can berepresented using video content whereby low elasticity is represented byslower playback of the video content than high elasticity which can berepresented with rapid playback. A physics modeling unit 1215, asdescribed in further detail below, may be included in the physics engine1200 to analyze the content.

In some implementations, the analysis of the attributes of the item thatcan be represented may be performed using the content enhancementworkstation 170 and the item physics sensor 175. In suchimplementations, a physics property of the item may be detected and,using the interaction with the video, audio, haptics, or otherperceptible outputs, simulate that physics property.

In some implementations, the analysis may include analyzing the content.As an example, for video showing a zipper being unzipped to a stoppingpoint, computer vision algorithms (e.g., feature/object detection andtracking) may be used to scan images included in the video for theposition of the hand. The position of the hand may be tracked for asequence of images for the video. The locations of the hand may berecorded as part of the physics model. For example, the ending locationof the hand may be identified indicating the maximum unzipped positionfor the zipper. The ending location of the hand may be coordinated witha time point or frame of the video. When the user interacts with theenhanced content, the system can map the user's finger position to thevideo frames of the zipper, and when the ending location is reached, thevideo stops playing frames. In some implementations, a haptic “kick” maybe provided to simulate the stopping point of the zipper. When the userscrubs their finger in the reverse direction on the touch screen, theframes of video may be played back in reverse at a speed mapping to thespeed of the finger.

As another example, for simulating an elastic interaction, video may berecorded showing two hands stretching fabric. One or more item physicssensors (e.g., tension sensors) on the fabric may be used to record theamount of tension in the fabric. This tension data can be mapped to thecorresponding frames of the video (e.g., the beginning of the videobeing associated with the most relaxed state to the end of the videowhich would be associated with the most tension), and included in thephysics model.

In some instances, the tension values and their changes over time (e.g.,curves or functions or averages) can be mapped to additional output suchas haptic patterns and intensities or to sounds, for inclusion in thephysics model. When the user interacts with the enhanced content, thevideo frames may be presented (e.g., played back) in accordance with theuser's finger positions and the physics model might require a harderpress or harder pull to move through the corresponding video frames(giving the impression of tension in the fabric), and at the same timethe user can feel the haptic buzz change as the tension increases ordecreases, or they might hear an audible tone that changes in accordancewith the tension in the fabric.

A physics property can be modeled in a variety of ways. For examplefriction can be mapped to requiring harder or faster touch inputs;changes in light as a product is tilted towards and away from a directedlight source can be captured in a video and then video frames can bemapped to accelerometer input; and the motion of heavy fabric moving inresponse to a breeze can be captured with video and mapped to intensityof input of blowing on a phone microphone. Models simulating thesefeatures may include time-based motion mapping to video frames. Themodels may also enhance a video of a “product interaction capture” incombination with item physics sensing (e.g., tension, light, sound, ormotion capture). The model may then provide a mapping of the sensed datato one or more simulated outputs (e.g., haptics, audio, video)coordinated in time (e.g., how fast the play back occurs, when the playback occurs) and/or space (e.g., where the interaction is detected,intensity of interaction detected, etc.) with the video content.

A physics model may be provided to simulate a physics property of anitem. The physics model may include information to cause presentation ofone or more outputs simulating the physics property. For example, tosimulate a surface feature of an item, such as a bump or surfacefriction, the physics model may indicate that haptic outputs are to bepresented. Other examples of the types of simulations are describedbelow. The physics property of the item may depend on a physicalattribute of the item. For example, consider a clothing item. Thephysics property of collar stiffness may be based, in part, on thematerial with which the collar is made. Physical attributes of an itemmay include the material forming the item or the size of the item. Otherphysics properties that may be simulated by the physics model includemagnetic properties, thermal properties, or optical properties of theitem.

To simulate the physics of an item, a physics model may include an audiophysics model. An audio physics model may be generated from audio datarepresenting the item to simulate a physics property of the item. Forexample, a brush may be associated with audio data representing thesound of the bristles of the brush. The sound of the bristles may bepresented to provide one representation of the brush. To enhance thepresentation of the item, the sound of the bristles may be representedusing haptic output. For example, if the bristles are stiff, the audiophysics model may cause generation of a sharp haptic output whereas ifthe bristles are soft, the audio physics model may cause a haptic outputhaving a lower intensity than the sharp haptic output to be generated.The audio physics model may indicate timing of the presentation of theoutput, strength (e.g., haptic intensity) of the output, presentationrate for the output, and duration of the output. The audio physics modelmay be generated based at least in part on analysis of the audio data aswill be described below.

A video physics model may be generated from video data representing theitem to simulate the physics of the item. Returning to the brushexample, if a user were to stroke the bristles of the physical item, theamount of effort needed for the stroke would be dependent on thestiffness of the bristles. A video of the brush may include a series ofimages showing a finger stroking the brush. Without enhancement, it isdifficult if not impossible for a viewer to assess how much effort isneeded for the stroking shown in the video. However, this physicsproperty of the bristles can be simulated by, for example, modeling alevel of user interaction to trigger presentation the video. Forexample, the video may be associated with a touch event userinteraction. If the physics property of the bristles is stiffness andthe bristles for the item shown in the video are rigid, to simulate theeffort to stroke the physical item, a touch event interaction with ahigh threshold may be defined. For example, the touch event interactionmay specify a pressure for the touch event. The pressure indicates howmuch pressure the user causing the interaction applied. The model mayinclude a pressure threshold for presenting the video. The pressurethreshold may be higher for more rigid items to simulate the resistanceof the item. By comparing the pressure for the touch event interactionwith the pressure threshold, the presentation of the video based on thetouch event interaction can thus include a tactile experience for theuser to simulate the physics property of the item. As another example,the weight of a fabric may be simulated using audio data received by amicrophone. For example, the user may blow into the microphone. The timeand/or orientation of the playback of a video of the fabric used for anitem shown in the video may be coordinated with the based on the audiodata and/or direction associated with the blowing. In this way, forlighter fabrics, audio data indicating a soft blow

The physics model may be associated with additional content. Forexample, it may be desirable to include audio, video, image, or textualcontent to further simulate the physics property of item. For example, avideo may show a button on a device. The video may not include soundrepresenting the button press or release. However, it may be desirableto include additional content, associated by the physics model with aportion of the video, to provide an audio cue that the button is pressedor released. In such instances, the physics model may identify a frameof video showing the button in a fully pressed state. The identificationmay be performed using image analysis of the images included in thevideo. The physics model may then associate audio data including aclicking sound with the frame of video showing the button in a fullypressed state. With such a physics model, the presentation of theclicking sound may be synchronized with the video to provide asimulation of the button.

The physics model may include associations between presentation pointsand interactions with the content showing the item. For example, forvideo content showing an item, the presentation points may be specifiedas a frame or time within the video content. The physics model mayassociate (i) a user interaction at a first location within the contentwith (ii) initiation of presentation of the content. The physics modelmay associate (i) a user interaction with the content at a locationwithin the content with (ii) a presentation location of the content.

The physics model may define a user interaction by specifying at leastone of a type of user interaction (e.g., touch, click, drag, gesture,manipulation of the device (e.g., rotation, shake, etc.), and the like)or a quantity of user interaction (e.g., pressure, speed, distance, andthe like for the user interaction). As noted above, the location of theuser interaction may also be included in the physics model. Accordingly,the physics model can be used by an access device to detect userinteractions and cause presentation of content in response to thedetected user interactions whereby the presentation simulates thephysics property of the item.

Some physics models may provide a simulation of a physics property of anitem. In some implementations, the physics model may provide asimulation of a portion of an item. This portion of the item may show akey object included in the item. The key object may be a discretecomponent of the item such as a zipper, button, or other functionalstructure.

Once the item shown in the content has been modeled, physics modelinginformation to configure the access device 200 (or the item physicsengine 226 associated therewith) to present a simulation of the physicsof the item represented by the physics model is generated such as by aphysics simulation generator 1200. As each access device may havedifferent output devices (e.g., audio output, display, haptic, etc.),the physics model generated for an item shown in content may betranslated to specific configuration commands in consideration of theaccess device 200 and/or the output devices associated therewith. Thephysics simulation generator 1220, as described in further detail below,may be included in the physics engine 1200 to generate the physicsmodeling information for an access device based on the physics model.

A physics processor unit 1240 may be included to coordinate thefunctions of the physics engine 1200. For example, the physics processorunit 1240 may be configured to coordinate the processing of requests forphysics modeling, generating the physics model, and transmitting thephysics model.

The physics engine 1200 shown in FIG. 12 may include a networkinput/output interface (“network I/O”) 1235. The network I/O 1235 may beconfigured to transmit and receive communications via a network such asthe Internet, a local area network, a wide area network, a peer-to-peernetwork, a cellular network, a satellite network, a public switchedtelephone network, any combination thereof, or other communicationsystem for exchanging data between electronic devices. In someimplementations, the network IO 1235 is configured to transmit requestmessages for access device information, receive messages identifyingcontent for physics modeling, and transmit messages including physicsmodel information, as will be described in further detail below.

Messages including requests for physics modeling information for contentmay be received via the network IO 1235 and provided to the physicsmodeling unit 1215. In the implementation shown in FIG. 12, the physicsmodeling unit 1215 is implemented as instructions residing in a memory1250 that, when executed, cause the physics engine 1200 to perform thefunctions described. In some implementations, the physics modeling unit1215 may be implemented as a hardware device (e.g., circuit) configuredto perform the functions described.

The physics modeling unit 1215 may receive content as an input andprovide a physics model for an item shown in the content as an output.The physics modeling unit 1215 may be configured to obtain the contentshowing an item for which the physics model is being generated. Thephysics modeling unit 1215 may be configured to obtain additionalcontent for inclusion in the physics model. For example, it may bedesirable to include audio output during an interaction such as thesound of a zipper zipping or unzipping. The additional content may beobtained by transmitting a request for the additional content to thecatalog system 110 and receiving a response including the requestedadditional content or an identifier therefor. In some implementationsthe system may include a library of content (e.g., video clips, soundclips, etc.) from which such additional content could be obtained. Therequest may include an identifier for the item shown in the enhancedcontent. The request may include a desired content type (e.g., audio,video, image, text, haptic) for generating the physics model.

FIG. 13 is a flow diagram depicting an illustrative routine executed bythe physic engine for generating a physics model simulating a physicsproperty of an item shown in the content. The process 1300 shown in FIG.13 illustrates how the physics modeling unit 1215 may generate a physicsmodel for an item shown in content. The process 1300 shown in FIG. 13may be implemented in whole or in part by a physics engine such asdiscussed in FIGS. 1 and 12 above. Certain aspects of the process 1300may be implemented by the physics modeling unit 1215 of a physicsengine. In some implementations, aspects of the process 1300 may beimplemented by or in conjunction with a content enhancement workstationsuch as shown in FIG. 1 and discussed above.

The process 1300 begins at block 1302. At block 1305, content showing anitem for modeling is received. The content may be provided in a messagereceived from the catalog system 110 or the access device 200 by thephysics engine 1200. The message may include the content and/or areference to the content. In cases where the content is provided byreference (e.g., URL, file name, content identifier), receiving thecontent at block 1305 may include obtaining the content by transmittinga request for the content based on the reference.

At block 1310, a key object of the item represented in or by the contentis identified. The key object is a portion of the item for which physicswill be modeled. Table 1 below includes examples of key objects that maybe detected for corresponding item types.

TABLE 1 Item Type Example Key Objects Shirt Button Collar ZipperEmbroidery Electronic Device Button Display Accessory Port BrushBristles Handle

Where the content includes image content, the key object may beidentified using object recognition analysis of the image content. Theobject recognition analysis may identify objects shown in the imagecontent. The identification may include identifying how much of theimage content is devoted to certain objects. In some implementations,the object shown in the majority of the image content may be identifiedas the key object. In some implementations, the item type may be used todirect the identification process. For example, if the type for the itemshown in the content is a brush, the key object identification can focuson identifying bristles or handle as the key object. The key objectsassociated with particular item types can be specified via aconfiguration such as a physics configuration provided to the physicsengine 1200.

Where the content includes video content, the key object may beidentified using object recognition analysis of the images included inthe video content. This analysis may be similar to the analysisperformed for image content discussed above. In some implementations,the key object may be identified by identifying one or more locationswithin the video content that change between the images included in thevideo content. For example, a location may be identified such asdescribed with reference to FIG. 10 above. Once the location 1055 isidentified, the object detection may be performed to detect objectswithin the video content at the location 1055 rather than processing theentire frame of video content. Motion detection, such as described withreference to FIG. 11 may be used in addition or as an alternative foridentifying the key object.

In some implementations, the content enhancement workstation 170 mayidentify or assist in the identification of the key object. In suchimplementations the content enhancement workstation 170 may generate andpresent an interface displaying the content. The interface may alsoinclude control elements to allow selection of an area of the contentincluding the key object. For example, the control element may include aselection tool (e.g., lasso, circle selector, bounding box selector,etc.) that can be used to identify the area around a key object. Uponreceiving the selection, such as via an input device coupled with thecontent enhancement workstation 170, the selection may be transmitted tothe content enhancement engine 1700 for further processing. The area mayidentify the key object or provide assistance to the detection of thekey object by identifying a smaller portion of the content to search forthe key object.

At block 1315, a physics property may be detected for the key object.The physics property may be detected based on the identification of thekey object. For example, if the key object was identified as a buttonfor an electronic device, a physics property of the button may be thesound the button makes when pressed or the amount of pressure needed toactivate the button. If the item is a clothing item, the physicsproperty may include the fabric type for the item. The fabric type maybe associated with an elasticity. The more elastic the fabric is, thefaster the item would return to a resting state. The physical propertiesmay be associated with item information maintained by the catalog system110. In such implementations, the physics modeling unit 1215 may querythe catalog system 110 to obtain the physical properties of an itemusing an identifier for the item shown in the content. As discussedabove, detection of the physics property may include receiving physicsdata from an item physics sensor such as the item physics sensor 175shown in FIG. 1.

At block 1320, a determination is made as to whether the contentincludes audio content. The determination may be based on metadata forthe content received such as file name, file extension, or fileproperties. The physics configuration may include a list of metadata andassociated audio content determinations. For example, if the fileextension of the file including the content is WAV the content is likelyto include audio content.

If the determination at block 1320 is affirmative, at block 1325, theaudio content is extracted from the content. For example, where thecontent received for modeling is video content, the audio content may beseparated from the image content. This allows modeling of the audiocontent separately from the image content. A codec may be included todecode the audio portion of the content and encode an audio only file.

Once the audio content is obtained, at block 1330, audio physics modelmay be generating based at least in part on one or more of the audio andthe physics property of the key object. One way the audio content may bemodeled is by providing haptic feedback corresponding to the audiocontent. For example, if the key object makes a sharp sound such as abutton snapping or a pop when a kitchen container opens, the sharpnessmay be simulated by causing haptic feedback synchronized with the audiocontent at the point of the sharp sound. The timing of when to presentthe haptic feedback as well as a quantity of haptic feedback to providemay be specified via the audio physics model. To identify the sound andgenerate modeled output, the waveform of the audio may be analyzed. Oneexample will be described below with reference to FIG. 14.

As another example, where the key object is shown in the content isunder high friction, the audio content may include audio data for thescraping that typically accompanies a high friction activity. Based onthe analysis of the waveform of the audio content the friction may beused to oscillate display of the image content at a frequency thatcorresponds to the friction to provide a visual representation of thephysical property of the key object shown. The quantity and frequency ofthe oscillation may be included in the audio physics model.

FIG. 14 is a graph diagram depicting an example waveform of audio datafor sound captured for the item and thresholds for haptic feedback thatmay be used by the physics engine to generate the physics model. Asdiscussed above, the physics model may include an audio physics model.FIG. 14 includes a waveform 1400 for audio data obtained from audiocontent. In the example of FIG. 14, the audio physics modeling isperformed based on thresholds shown as x and −x. The value of thethresholds may be specified in the physics configuration. The thresholdsmay identify a decibel level above which haptic feedback will betriggered. Accordingly, for at a time where the waveform 1400 exceedsthe threshold, the audio physics model may include information togenerate an output such as a haptic output.

In addition to identifying when to initiate the output, the analysis ofthe waveform 1400 in contrast to the threshold may also provide aquantity of output to provide. For example, the quantity of hapticoutput provided for a peak may be directly correlated with a differencebetween the peak and the threshold. In such implementations, loudersounds may be represented with more haptic output.

The audio physics modeling for the waveform 1400 may also indicate aduration for the output. For example, if the waveform 1400 includes asteady sound, such as an alarm clock tone, the output may be continueduntil the end of the tone.

The physics property of the key object may be used to for audio physicsmodeling. For example, if the key object is elastic, the speed withwhich a rubber band would snap back is faster than the pop of a kitchencontainer as the vacuum seal is released. Accordingly, the duration forthe output as specified in the audio physics model may be adjusted basedon the physics property of the key item.

After generating the audio physics model at block 1330 or if thedetermination at block 1320 is negative, at block 1335 a determinationis made as to whether the content representing the item being modeledincludes video.

The determination may be based metadata for the content received such asfile name, file extension, or file properties. The physics configurationmay include a list of metadata and associated video contentdeterminations. For example, if the file extension of the file includingthe content is AVI the content is likely to include video content.

If the determination is no, at block 1355, any models generated by theprocess 1300 are stored in association with the content. The models mayinclude an audio physics model generated at block 1330. The models maybe stored in a physics model storage such as the physics model storage1295 shown in FIG. 12. In some implementations, the models may be storedin association with an identifier for the item or key object shown inthe content.

Returning to block 1335, if it is determined that the content includesvideo content, at block 1340, the video images are extracted from thecontent. For example, the audio content may be separated from the imagesincluded in video content. This allows modeling of the item asrepresented in the audio content separately from the representation ofthe item via the images. A codec may be included to decode the imageportion of the content and encode an image only file.

At block 1345, key object movement in the video content may be detected.The movement may be detected using optical flow such as described abovewith reference to FIG. 11. Movement may be detected by comparing two ormore images extracted from the content. In such implementations, aninitial location of the key object shown in a first image may beidentified. A subsequent location of the key object may be identified ina second image. By comparing the initial location with the subsequentlocation, the movement of the key object can be determined (e.g.,interpolated between the initial location and the subsequent location).

At block 1350, a video physics model may be generated using videoimages, key object movement, and a physics property of the key object.For example, consider video content of a zipper being unzipped. At thestart of the video content, the zipper is fastened and as the videocontent is played back, the zipper is unfastened to an ending locationshown in a final frame of the video content. Physical properties may bedirectly correlated with user interactions specified in the videophysics model. For example, physical properties of the zipper mayinclude how tight the zipper teeth clasp. This may correspond to azipper requiring more force to unzip than a looser zipper. Physicalproperties may be inversely correlated with user interactions specifiedin the video physics model. For example, physical properties of thezipper may include the elasticity of the fabric near the zipper. If thefabric is stiff, as the zipper unzips, the surrounding fabric may notmove as quickly as a softer, suppler material. The tightness and/orelasticity of the key object can be modeled by how quickly the videocontent is played back. As part of generating the video physics model,the playback may be correlated with an interaction calibrated tosimulate the physics property of the zipper. For example, the magnitudeof interaction needed via an access device (e.g., touchscreen) to causeplayback of the video content may be proportional to the force needed tounzip the zipper of the physical item.

FIG. 15 is a flow diagram depicting an illustrative routine executed bythe physics engine for associating interactions with portions of thecontent showing the item modeled by the physics model for simulating aphysics property of the item. The process 1500 shown in FIG. 15illustrates how the physics modeling unit 1215 may generate a videophysics model for content to represent the physics property of the item.The process 1500 in FIG. 15 illustrates how interactions such as touch,click, eye tracking, etc. are associated with playback of the videocontent. While the interactions may be with a representation of the itemvia an access device, the interactions may simulate a physicalinteraction with the item.

The process 1500 shown in FIG. 15 may be implemented in whole or in partby a physics engine such as discussed in FIGS. 1 and 12 above. Certainaspects of the process 1500 may be implemented by the physics modelingunit 1215 of a physics engine. In some implementations, aspects of theprocess 1300 may be implemented by or in conjunction with a contentenhancement workstation such as shown in FIG. 1 and discussed above.

The generation of the video physics model begins at block 1402. Theprocess 1500 assumes video content showing an item is provided formodeling. At block 1405, a starting location for the movement of the keyobject is identified. In some implementations, the identification mayinclude object recognition of the key object in an initial frame of thevideo content. The key object recognition may be based on one or more ofan analysis of the pixels in the initial frame or segmentation of thevideo frame. The identification may be provided by or assisted byinformation provided by the content enhancement workstation 170. Forexample, the content enhancement workstation 170 may receive the videocontent and present an interface for selecting an area of the videocontent showing the key object.

The starting location identified at block 1405 represents where a userinteraction with the content should begin to activate the playback ofthe content. At block 1410, an initiation interaction location andmagnitude are associated with the starting location. The interactionmagnitude may indicate how vigorous the interaction must be to initiatethe playback. The magnitude, as discussed above, can represent stiffnessof a feature such as a zipper or button. It may be desirable to includea minimum threshold interaction magnitude. This can avoid triggeringplayback based on errant interactions.

Having determined where within the content for the item being modeledand how much interaction at the initiation location is needed toactivate playback of the content, at block 1515, an ending location forthe movement of the key object is identified. In some implementations,the identification may include object recognition of the key object in aframe at the end of the video content (e.g., the last frame showing theitem). The key object recognition may be based on one or more of ananalysis of the pixels in the video frame or segmentation of the videoframe. The interaction initiation location or magnitude may beidentified based on physics properties detected by an item physicssensor such as the item physics sensor 175 of FIG. 1.

At block 1520, frames of the video content are associated with aninteraction path from the starting location to the ending location. Theassociation models the location of the interaction along the path ofmovement to a frame of the video content. For example, when a fingerswipe is defined as the interaction with a zipper, as the finger swipeis dragged to a location half way between the starting location and theending location, the video content should have a playback position of aframe of video information showing the half-unzipped zipper. In thisway, the interaction with the video content is correlated with playbackto show the item in an expected state had the user interacted with thephysical item (e.g., half-unzipped). In some implementations, thecontent enhancement workstation 170 may present an interface forreceiving information identifying the interaction path. For example, theinterface may present a static image of a frame of video content. Theinterface may include a control element (e.g., button, link, etc.) toactivate capturing of an interaction for the content. Once activated,the content enhancement workstation 170 may record interactions such asselections, swipes, sound, movement, etc. with the content. The recordedinteractions may then be associated with the initiation interaction forthe content.

At block 1525, the magnitude of the interactions (e.g., the initiationinteraction or the interactions along the interaction path) may beadjusted to further model a physics property of the key object. Forexample, consider a two-phase locking lid for a mug. An initialinteraction may require a first magnitude touch interaction to seat thelid. Once seated, a second magnitude touch interaction may be specifiedto simulate the second locking phase. The playback of video contentshowing such a lid may stop playback after video frames showing thefirst phase are presented until the second magnitude touch interactionis detected. In this way, a user can “experience” the two-phase lockingvia an access device. The magnitude may be adjusted based on physicsproperties detected by an item physics sensor such as the item physicssensor 175 of FIG. 1.

At block 1530, a determination is made as to whether haptic output maybe provided for the physics property. The determination may be based onthe physics configuration whereby the generation of haptic output isspecified via the physics configuration for certain access devices,access device types, items, key objects, physics property, or otherinformation detectable within the system. The determination may beperformed based on functional characteristics of the access device thatwill be receiving the physics. For example, if the access device has nohaptic capabilities, it may be desirable to preserve resources and omitgenerating haptic output information.

If the determination at block 1530 is affirmative, at block 1535, ahaptic output is associated with one or more frames of video content. Inthe example of the two-phase locking lid, it may be desirable to providea haptic vibration at the end of the first phase. In suchimplementations, the haptic output may be associated with the playbacklocation showing the lid at the end of the first phase of closure. Thishaptic “bump” can further simulate the closing of the lid. Haptic outputassociated with the video content may include information indicating aproperty of the haptic output to provide at the associated frame ofvideo content. The information may indicate one or more of the patternof the haptic output, the strength of haptic output, and duration forthe haptic output.

After associating the haptic output at block 1535 or if thedetermination at block 1530 is negative, at block 1540 an interactioninterrupt playback is identified. For some items, if a user stopsinteracting with the key object, the key object may stay in place. Forexample, when unzipping a zipper, in the physical world, if the processis left half completed, the zipper typically stays in half completedstate. For some items, if a user stops interacting with the key object,it may be a more accurate representation of the physical item for thekey object to move to another location. For example, consider abutton-down shirt. In the physical world, if the collar of the shirt ismoved aside to view inside the shirt, if the collar is released, it willtypically return to the closed state, hiding the inside of the shirt. Tomodel this interaction, an interrupt playback behavior is identifiedwhereby if a user stops interacting with the collar, the playback mayrewind to show frames of video content reversing the collar to the showa frame with a closed view of the shirt.

Identifying the interaction interrupt playback may also includeidentifying a speed for playback. For example, when simulating a physicsproperty of a shirt, if the material forming the shirt is soft, thespeed at which the collar of the physical shirt would return to theclosed position is slower than for a stiffer fabric. As such, theplayback upon detecting an interaction disruption may be adjusted basedon a physical attribute of the key object. At block 1590, the process1500 ends.

Returning to FIG. 13, at block 1355, the video physics model generatedat block 1350 may be stored in association with the content provided forphysics modeling at block 1305. The video physics model may be stored ina physics model storage such as the physics model storage 1295 shown inFIG. 12. The process 1300 ends at block 1390 but may be repeated togenerate another physics model for the received at block 1305 content ordifferent content subsequently received.

Returning to FIG. 12, the physics models storage 1295 may bespecifically architected to efficiently store and retrieve physicsmodels for identified content. In some implementations, the physicsmodels may be stored in association with an identifier for the content.In such implementations, the physics models storage 1295 may be indexedusing the identifiers to expedite retrieval of the physics modelassociated therewith. In some implementations, the models may be storedin association with an identifier for the item or key object shown inthe content.

The physics engine 1200 may generate the physics model for an item shownin content before a request for the content from an access device isreceived. For example, the catalog system 110 may receive new contentshowing an item. In some implementations, it may be desirable togenerate the physics model such that the physics model is ready when arequest for the model is received. Because the physics model storage1295 includes a physics model for the content, the responding to therequest may not require generating the physics model at the time of therequest.

Whether the physics model is generated prior to receiving a request froman access device or in real-time, responsive to a request from an accessdevice, physics modeling information to configure the access device 200(or the item physics engine 226 associated therewith) to present thefeatures represented may be generated by the physics simulationgenerator 1220. In the implementation shown in FIG. 12, the physicssimulation generator 1220 is implemented as instructions residing in thememory 1250 that, when executed, cause the physics engine 1200 toperform the functions described. In some implementations, the physicssimulation generator 1220 may be implemented as a hardware device (e.g.,circuit) configured to perform the functions described.

The physics simulation generator 1220 may receive a physics model as aninput. In some implementations, the input may indirectly reference aphysics model such as by content identifier or item identifier which canbe used by the physics simulation generator 1220 to obtain the physicsmodel such as from the physics model storage 1295. The physicssimulation generator 1220 may provide a machine readable configurationto adjust the access device 200 and/or the item physics engine 330 tosimulate a physics property of the item shown in the content. Theconfiguration may include modeling information such as the type ofoutput (e.g., image, audio, haptic, etc.), location of userinteractions, magnitude of user interactions, etc. The physicssimulation generator 1220 may obtain the physics model for the item andgenerate the configuration. In some implementations, the type ofinteraction may also be specified via the configuration (e.g., touch,drag, click, tap, etc.).

In some implementations, it may be desirable to tailor the physicsmodeling information generated by the physics simulation generator 1220using the physics model to the access device that will be presenting thecontent. An access device detector 1225 may be included to receiveand/or detect the access device information. Similar to the accessdevice detector 325 shown in FIG. 3 or the access device detector 725shown in FIG. 7, the access device detector 1225 may be configured todetect functional characteristics of an access device such as displaydimensions, display orientation, an identifier for an application beingused to display the enhanced content, identifier of the item physicsengine 230 associated with the access device 200, resource level for theaccess device (e.g., power, network bandwidth, processor bandwidth,memory, etc.), and the like.

The access device detector 1225 shown in FIG. 12 is implemented asinstructions residing in the memory 1250 that, when executed, cause thephysics engine 1200 to perform the functions described. In someimplementations, the access device detector 1225 may be implemented as ahardware device (e.g., circuit) configured to perform the functionsdescribed.

The access device detector 1225 may detect functional characteristicsbased on a received request for a physics model. The request may includeheader information or other metadata providing information about theaccess device 200, application of the access device that generated therequest, and resources (e.g., network type) available to the accessdevice 200. In some implementations, the request may be submitted to thecontent enhancement engine 1700 on behalf of the access device 200. Forexample, the catalog system 110 may receive a request from the accessdevice 200. As part of generating a response to the request, the catalogsystem 110 may transmit a request for a physics model for an item shownin enhanced content stored in the item data store 126. The catalogsystem 110 may receive the physics model or information for retrievingthe physics model and provide this to the access device 200. In someimplementations, the request for the physics model may indicate thedelivery of the physics model from the content enhancement engine 1700to the access device 200. In such implementations, transmission details(e.g., IP address, protocol, security token, authentication information,encryption, etc.) may be provided in the request from the catalog system110 to specify how and where the physics model should be delivered. Insome implementations, the access device detector 1225 may provide thefunctional characteristics detected for the access device as an output.For example, a message including the functional characteristics detectedmay be provided to the physics simulation generator 1220 to tailor theoutput of the physics simulation generator 1220 based on the functionalcharacteristics.

In some implementations, the access device detector 1225 may obtainadditional information about the access device 200. The additionalinformation may be obtained using information about the access device200 included in the request such as a device identifier or username.Based on this information, the access device 200 information can beretrieved. In some implementations, the information may be retrievedfrom the access device 200 based on a request from the access devicedetector 1225 transmitted to the access device 200. The request mayinclude the information elements the access device detector 1225 isseeking. In some implementations, this information may be included aspart of session initiation (e.g., session initiation protocol session;real time transport protocol session) between the access device 200 andthe content enhancement engine 1700.

In some implementations, the access device detector 1225 may obtainfunctional characteristics of an access device from the catalog system110. For example, the catalog system 110 may request the physics modeland include them in a response to the access device 200. In suchimplementations, the catalog system 110 may store information about theaccess device 200 and/or its user. Such information may be provided tothe access device detector 1225 as part of the request message thatinitiates the generation of the physics model. In some implementations,the access device detector 1225 may request this information from thecatalog system 110.

In some implementations, the access device detector 1225 may identify aservice provider for the access device 200. The service provider may be,for example, an internet service provider or wireless service provider.The service provider may present an interface to service requests foraccess device functional characteristics. In some implementations, theaccess device detector 1225 may provide information about the accessdevice (e.g., IP address, device identifier, etc.) in a messagerequesting additional access device functional characteristics. Inresponse, the service provider may provide all or some of the requestedinformation.

In some implementations, the service provider may provide information tohelp control the flow of traffic through its network. As such, theservice provider may provide information about the traffic that may bepermitted for an access device rather than specific details about aparticular access device. This may include the types of content that canbe received by the access device 200, limits on the quantity of contentthat can be received by the access device 200, network speed for theaccess device 200 (e.g., 3G, 4G, LTE, enhanced LTE, Wi-Fi, etc.),whether the access device 200 is roaming, and the like. This informationcan be used by the physics engine 1200 to identify what types andquantities of physics models may be provided to and/or rendered by theaccess device 200.

The physics engine 1200 may further include a power source 1280. Thepower source 1280 may include a wired or wireless source of powersufficient to supply operational power to the elements described. Insome implementations, the power source 1280 may include a battery orother rechargeable source. In some implementations, the power source1280 may include a solar panel array.

The elements of the physics engine 1200 are coupled by a bus 1290. Thebus 1290 is configured to allow the elements to exchange data (such asthe messages described herein) and/or power. In some implementations,parallel busses may be included, one for data and one for power.

In some implementations, the physics engine 1200 may include anon-transitory computer-readable memory configured to store executableinstructions. The memory 1250 may be configured for this purpose. Insuch a configuration, the physics processor unit 1240 may be in datacommunication with the computer-readable memory storing the executableinstructions. The physics processor unit 1240 may be programmed by theexecutable instructions to implement the features described.

FIG. 16 is a schematic diagram of messages that may be sent betweencomponents of the content enhancement engine during an example itembrowsing session. The messages shown in FIG. 16 include messagesexchanged between exemplary entities selected to highlight certainfeatures related to requesting and generating enhanced content. Themessages shown in FIG. 16 illustrate how the content enhancement engine1700 may provide enhanced content and hints to the access device 200. Itwill be understood that fewer or additional entities may be included toachieve a similar result.

An item request message 1604 may be transmitted from the access device200 to the catalog system 110. The item request message 1604 may includean identifier for an item or query terms that can be used to identifythe item. The item request message 1604 may include informationidentifying the access device 200.

Via message 1606, the catalog system 110 may retrieve item data usingthe information included in the item request message 1604. The item datamay be retrieved from the item data store 126. The item data may includedescriptive information for the item (e.g., title, manufacturer, launchdate, user reviews, etc.). In some implementations, the item data mayinclude content showing the item.

The item data may be provided in a message 1608 to the contentenhancement engine 1700, such as via the navigation generator 300 asshown in FIG. 16. The message 1608 may include the content or referencesto the content identified by the catalog system 110. The message 1608may request generation of a navigation interface for the content showingthe item. Generating the navigation include may include the process 500shown in FIG. 5.

To tailor the content and/or hints, message 1610 may be transmitted tothe access device 200. The message 1610 may request functionalcharacteristics for an access device. The message 1610 may include arequest for functional characteristics such as display dimensions,display orientation, an identifier for an application being used todisplay the interface, resource level for the access device (e.g.,power, network bandwidth, processor bandwidth, memory, etc.), and thelike. Message 1612 may be received by the navigation generator 300. Themessage 1612 may include the functional characteristics provided by theaccess device 200. Once received by the content enhancement engine 1700,the functional characteristics may be provided to one or more of thenavigation generator 300, the content hint generator 700, and/or thephysics engine 1200 to tailor the respective outputs for the accessdevice 200.

The navigation generator 300 may request, via message 1614, a hint for amain view identified while generating the navigation interface. Thecontent hint generator 700 may generate the hint via the process 900shown in FIG. 9. The hint may be provided to the navigation generator300 via message 1616.

As shown in FIG. 16, the navigation generator 300 may also request aphysics model for an item shown in the main view via message 1618. Thephysics model may be generated by the process 1300 shown in FIG. 13and/or the process 1500 shown in FIG. 15. The physics model may beprovided in a response message 1620. In some implementations, thecontent enhancement engine 1700 may include a processor (not shown) tocoordinate the requests (e.g., message 1614 and message 1618) for thecontent hint generator 700, and the physics engine 1200.

Via message 1622, the item content is provided to the access device 200.The item content may include the navigation interface including a mainview with hinting information. Enhanced content corresponding to themain view may also be provided that simulates the item shown in theenhanced content.

FIG. 17 is a block diagram depicting an illustrative computing devicethat can implement the content enhancement engine shown in FIG. 1. Thecomputing device 1700 can be a server or other computing device, and cancomprise a processing unit 1702, a content enhancement processor 1730, anetwork interface 1704, a computer readable medium drive 1706, aninput/output device interface 1708, and a memory 1710. The networkinterface 1704 can provide connectivity to one or more networks orcomputing systems. The processing unit 1702 can receive information andinstructions from other computing systems or services via the networkinterface 1704. The network interface 1704 can also store data directlyto memory 1710. The processing unit 1702 can communicate to and frommemory 1710 and output information to an optional display 1718 via theinput/output device interface 1708. The input/output device interface1708 can also accept input from the optional input device 1720, such asa keyboard, mouse, digital pen, microphone, mass storage device, etc.

The memory 1710 contains computer program instructions that theprocessing unit 1702 executes in order to implement one or moreembodiments. The memory 1710 may include RAM, ROM, and/or otherpersistent, non-transitory computer readable media. The memory 1710 canstore an operating system 1712 that provides computer programinstructions for use by the processing unit 1702 or other elementsincluded in the computing device in the general administration andoperation of the computing device 1700. The memory 1710 can furtherinclude computer program instructions and other information forimplementing aspects of the present disclosure.

For example, in one embodiment, the memory 1710 includes an indirectcontent enhancement processing configuration 1714. The contentenhancement processing configuration 1714 may include the thresholds,navigation configuration, catalog system location (e.g., URL), contenthind configuration, physics configuration, automatic speech recognitionsystem, and other configurable parameters to dynamically adjust thecontent enhancement processor 1730 and/or the computing device 1700 toprocess item requests and generate enhanced content and interfaces asdescribed above. The content enhancement processing configuration 1714may store specific values for a given configuration element. Forexample, the specific threshold value may be included in the contentenhancement processing configuration 1714. The content enhancementprocessing configuration 1714 may, in some implementations, storeinformation for obtaining specific values for a given configurationelement such as from a network location (e.g., URL).

The memory 1710 may also include or communicate with one or moreauxiliary data stores, such as data store 1722. The data store 1722 mayelectronically store data regarding the composite images, extractedimages, extraction comparisons for specific item types, authorizationverdicts, and the like.

The elements included in the computing device 1700 may be coupled by abus 1790. The bus 1790 may be a data bus, communication bus, or otherbus mechanism to enable the various components of the computing device1700 to exchange information.

In some embodiments, the computing device 1700 may include additional orfewer components than are shown in FIG. 17. For example, a computingdevice 1700 may include more than one processing unit 1702 and computerreadable medium drive 1706. In another example, the computing device1702 may not be coupled to a display 1718 or an input device 1720. Insome embodiments, two or more computing devices 1700 may together form acomputer system for executing features of the present disclosure.

Depending on the embodiment, certain acts, events, or functions of anyof the processes or algorithms described herein can be performed in adifferent sequence, can be added, merged, or left out altogether (e.g.,not all described operations or events are necessary for the practice ofthe algorithm). Moreover, in certain embodiments, operations, or eventscan be performed concurrently, e.g., through multi-threaded processing,interrupt processing, or multiple processors or processor cores or onother parallel architectures, rather than sequentially.

The various illustrative logical blocks, modules, routines, andalgorithm steps described in connection with the embodiments disclosedherein can be implemented as electronic hardware, or as a combination ofelectronic hardware and executable software. To clearly illustrate thisinterchangeability, various illustrative components, blocks, modules,and steps have been described above generally in terms of theirfunctionality. Whether such functionality is implemented as hardware, oras software that runs on hardware, depends upon the particularapplication and design constraints imposed on the overall system. Thedescribed functionality can be implemented in varying ways for eachparticular application, but such implementation decisions should not beinterpreted as causing a departure from the scope of the disclosure.

Moreover, the various illustrative logical blocks and modules describedin connection with the embodiments disclosed herein can be implementedor performed by a content enhancement system or other hardware includedin a content enhancement system. The content enhancement system or otherhardware included in a content enhancement system may include a digitalsignal processor (DSP), an application specific integrated circuit(ASIC), a field programmable gate array (FPGA), a microprocessor, acontroller, microcontroller, or other programmable logic element,discrete gate or transistor logic, discrete hardware components, or anycombination thereof. For such content enhancement systems, devices usedto implement the system are specially designed to perform the contentcollection, detection, and enhancements described herein. A contentenhancement system may include electrical circuitry configured toprocess specific computer-executable to perform one or more of thefeatures described herein. In embodiments where the content enhancementsystem includes a FPGA or similar programmable elements, the contentenhancement system may provide one or more of the features describedprocessing without processing computer-executable instructions butinstead by configuring the FPGA or similar programmable element toperform the recited features. Although described herein primarily withrespect to digital technology, a content enhancement system may alsoinclude primarily analog components. For example, some or all of thecontent enhancement processing features described herein may beimplemented in analog circuitry or mixed analog and digital circuitry.

The elements of a method, process, routine, or algorithm described inconnection with the embodiments disclosed herein can be embodieddirectly in content enhancement system hardware, in a software moduleexecuted by a device included in the content enhancement system, or in acombination of the two. A content enhancement software module can residein RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory,registers, hard disk, a removable disk, a CD-ROM, or similar form of anon-transitory computer-readable storage medium. An exemplary storagemedium can be coupled to the content enhancement system such that thecontent enhancement system can read information from, and writeinformation to, the storage medium. In the alternative, the storagemedium can be integral to the content enhancement system. The contentenhancement system and the storage medium can reside in an ASIC. TheASIC can reside in a device configured to capture or process data suchas a microphone, a smartphone, a set-top-box, a tablet computer, adesktop computer, or the like. In the alternative, the contentenhancement system and the storage medium can reside as discretecomponents (e.g., sub-system) in a device configured to collect, detect,and enhance content as described herein.

Conditional language used herein, such as, among others, “can,” “could,”“might,” “may,” “e.g.,” and the like, unless specifically statedotherwise, or otherwise understood within the context as used, isgenerally intended to convey that certain embodiments include, whileother embodiments do not include, certain features, elements, and/orsteps. Thus, such conditional language is not generally intended toimply that features, elements, and/or steps are in any way required forone or more embodiments or that one or more embodiments necessarilyinclude logic for deciding, with or without other input or prompting,whether these features, elements, and/or steps are included or are to beperformed in any particular embodiment. The terms “comprising,”“including,” “having,” and the like are synonymous and are usedinclusively, in an open-ended fashion, and do not exclude additionalelements, features, acts, operations, and so forth. Also, the term “or”is used in its inclusive sense (and not in its exclusive sense) so thatwhen used, for example, to connect a list of elements, the term “or”means one, some, or all of the elements in the list.

Disjunctive language such as the phrase “at least one of X, Y, Z,”unless specifically stated otherwise, is otherwise understood with thecontext as used in general to present that an item, term, etc., may beeither X, Y, or Z, or any combination thereof (e.g., X, Y, and/or Z).Thus, such disjunctive language is not generally intended to, and shouldnot, imply that certain embodiments require at least one of X, at leastone of Y, or at least one of Z to each be present.

Unless otherwise explicitly stated, articles such as “a” or “an” shouldgenerally be interpreted to include one or more described items.Accordingly, phrases such as “a device configured to” are intended toinclude one or more recited devices. Such one or more recited devicescan also be collectively configured to carry out the stated recitations.For example, “a processor configured to carry out recitations A, B andC” can include a first processor configured to carry out recitation Aworking in conjunction with a second processor configured to carry outrecitations B and C.

As used herein, the terms “determine” or “determining” encompass a widevariety of actions. For example, “determining” may include calculating,computing, processing, deriving, generating, obtaining, looking up(e.g., looking up in a table, a database or another data structure),ascertaining and the like via a hardware element without userintervention. Also, “determining” may include receiving (e.g., receivinginformation), accessing (e.g., accessing data in a memory) and the likevia a hardware element without user intervention. Also, “determining”may include resolving, selecting, choosing, establishing, and the likevia a hardware element without user intervention.

As used herein, the terms “provide” or “providing” encompass a widevariety of actions. For example, “providing” may include storing a valuein a location of a storage device for subsequent retrieval, transmittinga value directly to the recipient via at least one wired or wirelesscommunication medium, transmitting or storing a reference to a value,and the like. “Providing” may also include encoding, decoding,encrypting, decrypting, validating, verifying, and the like via ahardware element.

As used herein, the term “message” encompasses a wide variety of formatsfor communicating (e.g., transmitting or receiving) information. Amessage may include a machine readable aggregation of information suchas an XML document, fixed field message, comma separated message, or thelike. A message may, in some implementations, include a signal utilizedto transmit one or more representations of the information. Whilerecited in the singular, it will be understood that a message may becomposed, transmitted, stored, received, etc. in multiple parts.

As used herein a “user interface” (also referred to as an interactiveuser interface, a graphical user interface or a UI) may refer to anetwork based interface including data fields and/or other controls forreceiving input signals or providing electronic information and/or forproviding information to the user in response to any received inputsignals. A UI may be implemented in whole or in part using technologiessuch as hyper-text mark-up language (HTML), Flash, Java, .net, webservices, and rich site summary (RSS). In some implementations, a UI maybe included in a stand-alone client (for example, thick client, fatclient) configured to communicate (e.g., send or receive data) inaccordance with one or more of the aspects described.

Aspects of the disclosure describe pixel processing. Pixels mayrepresent visual information in two dimensions. In some implementations,these aspects may also be applied for voxel processing. Voxels mayrepresent visual information in three dimensions. For example, thephysics model may provide simulate a physics property of an item in athree dimensional space such as experienced via an augmented reality orvirtual reality system. Whether processing pixels or voxels, theinformation represented by the pixel of voxel may include colorinformation including grayscale or black/white implementations withoutfull-spectrum color. In some implementations, the pixel or voxel may beassociated with depth information.

While the above detailed description has shown, described, and pointedout novel features as applied to various embodiments, it can beunderstood that various omissions, substitutions, and changes in theform and details of the devices or algorithms illustrated can be madewithout departing from the spirit of the disclosure. As can berecognized, certain embodiments described herein can be embodied withina form that does not provide all of the features and benefits set forthherein, as some features can be used or practiced separately fromothers. The scope of certain embodiments disclosed herein is indicatedby the appended claims rather than by the foregoing description. Allchanges which come within the meaning and range of equivalency of theclaims are to be embraced within their scope.

The invention claimed is:
 1. A computer-implemented method comprising:under control of a computing device executing specificcomputer-executable instructions, receiving, from a display device, arequest for a content including an item; retrieving, from an item datastorage, a first image including the item and a second image including aportion of the item also included in the first image; determining thatan amount of the item included in the first image exceeds an amount ofthe item included in the second image; identifying the first image as anoverview image and the second image as a detailed image; detecting aregion within the overview image corresponding to the portion of theitem included in the detailed image, wherein the detecting is based atleast in part on a comparison of first pixel information for the regionin the overview image with second pixel information for the detailedimage; causing display of the overview image via the display device; andcausing display of an interactive control element identifying the regionof the overview image via the display device, wherein the interactivecontrol element, when activated, causes display of the detailed image.2. The computer-implemented method of claim 1, further comprising:detecting an interaction with the interactive control element, theinteraction indicating at least one of a type of interaction or aquantity of interaction; determining the interactive control element hasbeen activated based at least in part on a comparison of the type ofinteraction or quantity of interaction with an activation threshold; andcausing display of the detailed image in place of the overview image. 3.The computer-implemented method of claim 1, further comprising:generating preview image of the detailed image, wherein the dimensionsof the preview image are smaller than the dimensions of the detailedimage; causing display of the preview image for concurrent viewing withthe overview image; detecting an interaction with the preview image; andadjusting the visibility of the interactive control element based atleast in part on the interaction.
 4. The computer-implemented method ofclaim 1, further comprising: receiving the request from an accessdevice; and detecting a functional characteristic of the access deviceusing information included in the request, wherein the functionalcharacteristic is one of: display dimensions, display orientation, anidentifier for an application being used for display, or level of aresource for the access device, and wherein retrieving the first imageand the second image is based at least in part on the functionalcharacteristic of the access device, wherein the first image and thesecond image can be received and displayed by the access deviceassociated with the functional characteristic.
 5. A computer-implementedmethod comprising: under control of a computing device executingspecific computer-executable instructions, receiving, from a displaydevice, a request for a content including an item; retrieving, from anitem data storage, a first content including the item and a secondcontent including a portion of the item also shown in the first content;identifying a region in the first content corresponding to the portionof the item included in the second content based at least in part on acomparison of first pixel information for the region in the firstcontent with second pixel information for the second content; andcausing display of the first content and an interactive control elementfor the second content via the display device, wherein the interactivecontrol element for the second content denotes the region in the firstcontent corresponding to the portion of the item included in the secondcontent.
 6. The computer-implemented method of claim 5, whereinreceiving the request for a content including the item comprisesreceiving a message from an access device, the message includinginformation identifying the item, and wherein retrieving the firstcontent and the second content comprises: querying a catalog systemusing the information identifying the item; and receiving at least oneof the first content and the second content in response to the querying.7. The computer-implemented method of claim 5, wherein identifying theregion in the first content corresponding to the portion of the itemincluded in the second content comprises: identifying pixels in thefirst content corresponding to the pixels in the second content bycomparing an image feature included in the second content with the imagefeature included in the first content, wherein the image feature isidentified by at least one of: color information; color gradientlocation information; color gradient orientation information; or anobject recognized in the first content and the second content.
 8. Thecomputer-implemented method of claim 5, wherein the computer-implementedmethod further comprises: detecting an interaction with the interactivecontrol element, the interaction indicating at least one of a type ofinteraction or a quantity of interaction; determining the interactivecontrol element has been activated based at least in part on acomparison of the type of interaction or quantity of interaction with anactivation threshold; and causing display of the second content in placeof the first content.
 9. The computer-implemented method of claim 8,wherein causing display of the second content in place of the firstcontent comprises causing display of a series of images, each image inthe series showing less of the item than a previous image in the series,wherein the series of images ends with an image showing the portion ofthe item shown in the second content.
 10. The computer-implementedmethod of claim 5, further comprising: generating preview content of thesecond content, wherein the dimensions of the preview content aresmaller than the dimensions of the second content; causing display ofthe preview content for concurrent viewing with the first content;detecting an interaction with the preview content; and adjusting thevisibility of the interactive control element based at least in part onthe interaction.
 11. The computer-implemented method of claim 5, furthercomprising: receiving the request via an access device; and detecting afunctional characteristic of the access device, wherein retrieving thefirst content and the second content is based at least in part on thefunctional characteristic of the access device.
 12. Thecomputer-implemented method of claim 11, further comprising: retrievinga set of content including the first content, the second content andthird content; determining the functional characteristic of the accessdevice is incompatible with the third content; and removing the thirdcontent from the set of content.
 13. A system comprising: a data storestoring specific computer-executable instructions; and a computingdevice in communication with the data store, the computing deviceincluding a processor that executes the specific computer-executableinstructions to cause the system to at least: receive, from acommunication device, a request for a content including an item;retrieve, from an item data storage, a first content including the itemand a second content including a portion of the item also included inthe first content; identify a region in the first content correspondingto the portion of the item included in the second content based at leastin part on a comparison of first pixel information for the region in thefirst content with second pixel information for the second content; andcause display of the first content and an interactive control elementfor the second content, via the communication device, wherein theinteractive control element for the second content denotes the region inthe first content.
 14. The system of claim 13, wherein the specificcomputer-executable instructions causing the system to receive therequest for a content including the item comprises instructions causingthe system to receive a message from an access device, the messageincluding information identifying the item, and wherein the specificcomputer-executable instructions cause the system to retrieve the firstcontent and the second content by at least: querying a catalog systemusing the information identifying the item; and receiving at least oneof the first content and the second content in response to the querying.15. The system of claim 13, wherein the specific computer-executableinstructions cause the system to identify the region in the firstcontent by at least: identifying pixels in the first contentcorresponding to the pixels in the second content by comparing an imagefeature included in the second content with the image feature includedin the first content, wherein the image feature is identified by atleast one of: color information; color gradient location information;color gradient orientation information; or an object recognized in thefirst content and the second content.
 16. The system of claim 13,wherein the processor executes the specific computer-executableinstructions to further cause the system to at least: detect aninteraction with the interactive control element, the interactionindicating at least one of a type of interaction or a quantity ofinteraction; determine the interactive control element has beenactivated based at least in part on a comparison of the type ofinteraction or quantity of interaction with an activation threshold; andcause display of the second content in place of the first content. 17.The system of claim 16, wherein the specific computer-executableinstructions cause the system to cause display of the second content inplace of the first content by at least causing display of a series ofimages, each image in the series showing less of the item than aprevious image in the series, wherein the series of images ends with animage showing the portion of the item shown in the second content. 18.The system of claim 13, wherein the processor executes the specificcomputer-executable instructions to further cause the system to atleast: generate preview content of the second content, wherein thedimensions of the preview image are smaller than the dimensions of thesecond content; cause display of the preview content for concurrentviewing with the first content; detect an interaction with the previewcontent; and adjust the visibility of the interactive control elementbased at least in part on the interaction.
 19. The system of claim 13,wherein the processor executes the specific computer-executableinstructions to further cause the system to at least: receive therequest via an access device; and detect a functional characteristic ofthe access device, wherein retrieving the first content and the secondcontent is based at least in part on the functional characteristic ofthe access device.
 20. The system of claim 19, wherein the processorexecutes the specific computer-executable instructions to further causethe system to at least: retrieve a set of content including the firstcontent, the second content and third content; determine the functionalcharacteristic of the access device is incompatible with the thirdcontent; and remove the third content from the set of content.